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TRANSMITTAL LETTER TO THE UNITED ST^|SAQ£j!>^ ATTORNEY'S DOCKET NUMBER 50915 

U.S. APPLICATION NO. (If known, see 37 CFR 1 .5) 

INTERNATIONAL FILING DATE PRIORITY DATE CLAIMED 
27 July 2000 27 July 1999 

18 November 1999 
22 March 2000 

kND THEIR USE FC 

APPLICANT(S) FOR DO/EO/US Bernhard HAUER, Juergen PLEISS, Ulrich SCHWANEBERG, Jutta SCHMITT, Markus FISCHER 
Rolf SCHMID, Qing-shan LI, Sabine Lutz-WAHL, Daniel APPEL 



I . IXI This is a FIRST submission of items concerning a filing under 35 U.S.C. 371 . 

2.11 This is a SECOND or SUBSEQUENT submission of items concerning a filing under 35 U.S.C. 371 . 

3. 1X1 This express request to begin national examination procedures (35 U.S.C.371 (f)) at any time rather than delay examination until 
the expiration of the applicable time limit set in 35 U.S.C. 371(b) and PCT Articles 22 and 39(1). 

A. Ixl A proper Demand for International Preliminary Examination was made by the 1 9th month from the earliest claimed priority date. 

5. 1X1 A copy of the International Application as filed (35 U.S.C. 371 (c)(2)). 

a. /X/ is transmitted herewith (required only if not transmitted by the International Bureau). 

b. / / has been transmitted by the International Bureau. 

c. l I is not required, as the application was filed in the United States Receiving Office (RO/USO). 

6. IXI A translation of the international Application into English (35 U.S.C. 371 (c)(2)). 

7. IX I Amendments to the claims of the International Application under PCT Article 19 (35 U.S.C. 371(c)(3)). 

a. / X / are transmitted herewith (required only if not transmitted by the International Bureau). 

b. / / have been transmitted by the International Bureau. 

c. l I have not been made; however, the time limit for making such amendments has NOT expired. 

d. / / have not been made and will not be made. 

8. / X / A translation of the amendments to the claims under PCT Article 1 9(35 U.S.C. 371 (c)(3)). 

9. / X / An oath or declaration of the inventor(s)(35 U.S.C. 1 71 (c)(4)). 

10. / / A translation of the annexes to the International Preliminary Examination Report under PCT Article 36 (35 U.S.C. 371 (c)(5)). 
Items 1 1 . to 1 6. below concern other document(s) or information included: 

II. // An Information Disclosure Statement under 37 CFR 1 .97 and 1 .98. 

1 2.1 X / An assignment document for recording. A separate cover sheet in compliance with 37 CFR 3.28 and 3.31 is included. 

13./X / A FIRST preliminary amendment. 
/ / A SECOND or SUBSEQUENT preliminary amendment. 

1 47 / A substitute specification. 

15. // A change of power of attorney and/or address letter. 

1 6. /x / Other items or information. 

International Search Report 

International Preliminary Examination Report 
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17. /X/ The following fees are submitted 

BASIC NATIONAL FEE (37 CFR 1 .492(a)(1 )-(5)): 

Search Report has been prepared by the 

EPO orJPO $890.00 

International preliminary examination fee paid to USPTO 
(37 CFR 1.482) $710.00 

No international preliminary examination fee paid to 
USPTO (37 CFR 1 .482) but international search fee paid 
to USPTO (37 CFR 1 .445(a)(2)) $740.00 

Neither international preliminary examination fee 

(37 CFR 1 .482) nor international search fee 

(37 CFR 1 .445(a)(2)) paid to USPTO $ 1 ,040.00 

International preliminary examination fee paid to 
USPTO (37 CFR 1 .482) and all claims satisfied pro 
>f PCT Article 33(2)-(4) $1 00.00 



ENTER APPROPRIATE BASIC FEE AMOUNT = 



CALCULATIONS 



PTO USE ONLY 



Surcharge of $1 30.00 for furnishing the oath or declaration 
later than / / 20 / /30 months from the earliest 
claimed priority date (37 CFR 1 .492(e)). 



Total Claims 
Indep.Claims 
Multiple dependent claim(s)(if applicable) 



TOTAL OF ABOVE CALCULATION 



Reduction of 1/2 for filing by small entity, if applicable. 
Verified Small Entity statement must also be filed 
(Note 37 CFR 1 .9, 1.27, 1.28). 



Processing fee of $1 30. for furnishing the English 
translation later than / /20 / /30 months from the 
earliest claimed priority date (37 CFR 1.492(f)). 



TOTAL NATIONAL F 



Fee for recording the enclosed assignment (37 CFR 1 .21 (h)). 
The assignment must be accompanied by an appropriate cover 
sheet (37 CFR 3.28, 3.31 ) $40.00 p 



TOTAL FEES ENCLOSED 



= $ 984.00 

Amount to be 

refunded: $ 



A check in the amount of $ 984.0 0 to cover the above fees is enclosed. 

n the amount of $ 



:o cover the above fees. A duplicate copy of this sheet 



c.tXl The Commissioner is hereby authorized to charge any additional fees which may be required, or credit any overpayment to Deposit 
• Account No. 11-0345 . A duplicate copy of this sheetis enclosed. 

NOTE: Where an appropriate time limit under 37 CFR 1 .494 or 1 .495 has not been met, a petition to revive (37 CFR 1 .137(a) or (b) must 
be filed and granted to restore the application to pending status. 



SEND ALL CORRESPONDENCE TO : 
KEIL& WEINKAUF 
1101 Connecticut Ave., N.W. 
Washington. D. C. 20036 



SlgtfATURE 

Herbert B. Keil 

NAME 

Registration No. 18,967 
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IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 



In re the Application of 
HAUER et al. 



BOX PCT 



International Application 
PCT/EP 00/07253 

Filed: July 27, 2000 



For NOVEL CYTOCHROME P450 MONOOXYGENASES AND THEIR USE FOR THE 
OXIDATION OF ORGANIC COMPOUNDS 



Honorable Commissioner of 

Patents and Trademarks 
Washington, D.C. 20231 

Sir: 

Prior to examination, kindly amend the above-identified application as follows: 
IN THE CLAIMS 

Kindly amend the claims as shown on the attached sheets. 



The claims were amended in the preliminary examination. The claims have been 
amended to eliminate multiple dependency and to place them in better form for U.S. filing, 
new matter is included. 

A clean copy of the claims is attached. 

Favorable action is solicited. 



PRELIMINARY AMENDMENT 



REMARKS 



Respectfully submitted, 
KEIL & WEINKAUF 




Herbert B. Keil 
Reg. No. 18,967 



1101 Connecticut Ave., N.W. 
Washington, D.C. 20036 



(202)659-0100 
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CLEAN VERSION OF AMENDED CLAIMS - 50915 

4. A nucleic acid sequence coding for a monooxygenase according to claim 1 . 
11. A process as claimed in claim 9, where the monooxygenase is derived from 
cytochrome P450 monooxygenase BM-3 from Bacillus megaterium having an 
amino acid sequence according to SEQ ID NO:2, which has at least one 
functional mutation in at least one of the amino acid sequence regions 1 72-224, 
39-43. 48-52. 67-70, 330-335, 352-356, 73-82 and 86-88. 
1 3. A process for microbiological oxidation of optionally substituted mono- or 
polynuclear aromatics, straight-chain or branched alkanes or alkenes, or 
optionally substituted cycloalkanes or cycloalkenes, which comprises 
a1 ) culturing the recombinant cytochrome P450-producing microorganism as 
claimed in claim 7 in a culture medium, in the presence of an exogenous 
or intermediately formed substrate; or 
a2) incubating a substrate-containing reaction medium with a cytochrome 
P450 monooxygenase derived from cytochrome P450 monooxygenase 
BM-3 from Bacillus megaterium having an amino acid sequence 
according to SEQ ID NO:2, which has at least one functional mutation in 
at least one of the amino acid sequence regions 172-224, 39-43. 48-52. 
67-70, 330-335, 352-356, 73-82 and 86-88; and 
b) isolating the oxidation product formed or a secondary product thereof 

from the medium; 
where the monooxygenase mutant Phe87Val is not excluded. 
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CLEA/V VERSION OF AMENDED CLAIMS - 50975 

15. (cancel) 

16. A process as claimed in claim 13, where the cytochrome P450 monooxygenase 
has at least one of the following mono- or polyamino acid substitutions: 

a) Phe87Val; 

b) Phe87Val, Leu188Gln; or 

c) Phe87Val, Leu188Gln, Ala74Gly. 

17. A process as claimed in claim 9, wherein, as exogenous substrate, at least one 
compound selected from the groups a) to d) of compounds defined above is 
added to a medium and the oxidation is carried out by enzymatic reaction of the 
substrate-containing medium in the presence of oxygen at a temperature of 
approximately 20 to 40°C and a pH of approximately 6 to 9, where the substrate- 
containing medium additionally contains an approximately 10- to 100-fold molar 
excess of reduction equivalents based on the substrate. 

1 9. A process for the microbiological production of indigo and/or indixubin, which 
comprises 

a1 ) culturing a recombinant microorganism which produces an indole- 
oxidizing cytochrome P450 in a culture medium, in the presence of 
exogenous or intermediately formed indole; or 

a2) incubating an indole-containing reaction medium with an indole-oxidizing 
cytochrome P450 monooxygenase; and 

b) isolating the oxidation product formed or a secondary product thereof 

2 
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CLEAN VERSION OF AMENDED CLAIMS - 50915 

from the medium. 

22. A process as claimed in claim 20 , where the monooxygenase is derived from 
cytochrome P450 monooxygenase BM-3 from Bacillus megaterium having an 
amino acid sequence according to SEQ ID NO:2, which has at least one 
functional mutation in at least one of the amino acid sequence regions 172-224, 
39-43. 48-52. 67-70, 330-335, 352-356, 73-82 and 86-88, including the 
substitution Phe87Val. 

23. A process as claimed in claim 22, where the monooxygenase has at least one of 
the following mono- or polyamino acid substitutions: 

a) Phe87Val; 

b) Phe87Val, Leu188Gln; or 

c) Phe87Val, Leu188Gln, A!a74Gly. 

24. A bioreactor comprising the cytochrom P450 monooxygenase as claimed in 
claim 1 or a recombinant microorganism transformed by a vector comprising an 
expression construct comprising a nucleic acid sequence coding for the 
cytochrom P450 monooxygenase of claim 1 in immobilized form. 

25. (cancel) 

26. (cancel) 
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MARKED VERSION OF AMENDED CLAIMS - 50915 

4. (amended) A nucleic acid sequence coding for a monooxygenase according to 
[one of the preceding claims] claim 1 . 

1 1 . (amended) A process as claimed in claim 9 [or 1 0], where the monooxygenase 
is [a mutant as claimed in any of claims 1 to 3, including the mutant Phe87Val] 
derived from cytochrome P450 monooxygenase BM-3 from Bacillus megaterium 
having an amino acid seguence according to SEQ ID NO:2. which has at least 
one functional mutation in at least one of the amino acid seguence regions 172- 
224. 39-43. 48-52. 67-70, 330-335. 352-356. 73-82 and 86-88 . 

1 3. (amended) A process for microbiological oxidation of [a compound as defined in 
claim 1 b), c) or d)] optionally substituted mono- or polvnuclear aromatics, 
straight-chain or branched alkanes or alkenes. or optionally substituted 
cvcloalkanes or cycloalkenes . which comprises 

a1 ) culturing [a] the recombinant cytochrome P450-producing microorganism 
as claimed in claim 7 [or 8] in a culture medium, in the presence of an 
exogenous or intermediately formed substrate; or 

a2) incubating a substrate-containing reaction medium with a cytochrome 

P450 monooxygenase [as claimed in any of claims 1 to 3] derived from 
cytochrome P450 monooxygenase BM-3 from Bacillus megaterium having 
an amino acid seguence according to SEQ ID NQ:2, which has at least 
one functional mutation in at least one of the amino acid seguence 
regions 172-224. 39-43. 48-52. 67-70. 330-335. 352-356. 73-82 and 86- 



1 
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marked, VERSION OF AMENDED CLAIMS - 50915 

15. (cancel) 

16. (amended) A process as claimed in claim [15] 13, where the [mutant] 
cytochrome P450 monooxvaenase has at least one of the following mono- or 
polyamino acid substitutions: 

a) Phe87Val; 

b) Phe87Val, Leu188Gln; or 

c) Phe87Val, Leu188Gln, Ala74Gly. 

17. (amended) A process as claimed in [any of claims] claim 9 [to 16], wherein, as 
exogenous substrate, at least one compound selected from the groups a) to d) of 
compounds defined above is added to a medium and the oxidation is carried out 
by enzymatic reaction of the substrate-containing medium in the presence of 
oxygen at a temperature of approximately 20 to 40°C and a pH of approximately 
6 to 9, where the substrate-containing medium additionally contains an 
approximately 10- to 100-fold molar excess of reduction equivalents based on 
the substrate. 

19. (amended) A process for the microbiological production of indigo and/or 
indixubin, which comprises 

a1 ) culturing a recombinant microorganism which produces an indole- 
oxidizing cytochrome P450 in a culture medium, in the presence of 
exogenous or intermediately formed indole; or 

a2) incubating an indole-containing reaction medium with an indole-oxidizing 
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MARKED VERSION OF AMENDED CLAIMS - 50915 

a1 ) culturing a recombinant microorganism which produces an indole- 

oxidizing cytochrome P450 in a culture medium, in the presence of 

exogenous or intermediately formed indole; or 
a2) incubating an indole-containing reaction medium with an indole-oxidizing 

cytochrome P450 monooxygenase; and 
b) isolating the oxidation product formed or a secondary product thereof 

from the medium^ 

22. (amended) A process as claimed in claim 20 [or 21], where the monooxygenase 
is [a mutant as claimed in any of claims 1 to 3] derived from cytochrome P450 
monooxygenase BM-3 from Bacillus megaterium having an amino acid sequence 
according to SEQ ID NO:2. which has at least one functional mutation in at least 
one of the amino acid sequence regions 172-224. 39-43. 48-52. 67-70. 330-335, 
352-356. 73-82 and 86-88. including the [mutant] substitution Phe87Val. 

23. (amended) A process as claimed in claim 22, where the [mutant] 
monooxygenase has at least one of the following mono- or polyamino acid 
substitutions: 

a) Phe87Val; 

b) Phe87Val, Leu188Gln; or 

c) Phe87Val, Leu188G!n, Ala74Gly. 

24. (amended) A bioreactpr comprising [an enzyme] the cytochrom P450 
monooxygenase as claimed in [one of claims] claim 1 [to 3] or a recombinant 
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MARKED VERSION OF AMENDED CLAIMS - 50915 

microorganism [as claimed in one of claims 7 or 8] transformed bv a vector 
comprising an expression construct comprising a nucleic acid sequence coding 
for the cvtochrom P450 monooxvaenase of claim 1 in immobilized form. 

25. (cancel) 

26. (cancel) 



4 
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CLAIMS AS FILED - 50915 

1 . A cytochrome P450 monooxygenase which is capable of at least one of the 
following reactions: 

a) oxidation of optionally substituted N-, O- or S-heterocyclic mono- or 
polynuclear aromatic compounds; 

b) oxidation of optionally substituted mono- or polynuclear aromatics; 

c) oxidation of straight-chain or branched alkanes and alkenes; 

d) oxidation of optionally substituted cycloalkanes and cycloalkenes; 
where the monooxygenase is derived from cytochrome P450 
monooxygenase BM-3 from Bacillus megaterium having an amino acid 
sequence according to SEQ ID NO:2, which has at least one functional 
mutation in at least one of the amino acid sequence regions 172-224, 39- 
43. 48-52. 67-70, 330-335, 352-356, 73-82 and 86-88; except the single 
mutant Phe87Val. 

2. A monooxygenase as claimed in claim 1 , which has at least one functional 
mutation in at least one of the sequence regions 73-82, 86-88 and 172-224. 

3. A monooxygenase as claimed in claim 1 , which has at least one of the following 
mono- or polyamino acid substitutions: 

a) Phe87Val, Leul88Gln; or 

b) Phe87Val, Leu188GIn, Ala74Gly; 

and functional equivalents thereof which are capable of at least one of the 
above oxidation reactions. 



1 
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CLAIMS AS FILED - 50915 

4. A nucleic acid sequence coding for a monooxygenase according to claim 1 . 

5. An expression construct comprising, under the genetic control of 
regulatory nucleic acid sequences, a coding sequence which comprises a 
nucleic acid sequence according to claim 4. 

6. A vector comprising at least one expression construct according to claim 5. 

7. A recombinant microorganism transformed by at least one vector as claimed in 
claim 6. 

8. A microorganism as claimed in claim 7, selected from bacteria of the genus 
Escherichia. 

9. A process for the microbiological oxidation of an N- or S-heterocyclic mono- or 
polynuclear aromatic compound which comprises 

a1 ) culturing a recombinant microorganism which expresses a cytochrome 
P450 monooxygenase of bacterial origin in a culture medium, in the 
presence of an exogenous or intermediately formed substrate; or 

a2) incubating a substrate-containing reaction medium with a cytochrome 
P450 monooxygenase of bacterial origin; and 

b) isolating the oxidation product formed or a secondary product thereof 
from the medium. 

10. A process as claimed in claim 9, wherein the exogenous or intermediately 
formed substrate is selected from optionally substituted - or S-heterocyclic 
mono- or polynuclear aromatic compounds. 
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CLAIMS AS FILED - 50915 

11. A process as claimed in claim 9, where the monooxygenase is derived from 
cytochrome P450 monooxygenase BM-3 from Bacillus megaterium having an 
amino acid sequence according to SEQ ID NO:2, which has at least one 
functional mutation in at least one of the amino acid sequence regions 172-224, 
39-43. 48-52. 67-70, 330-335, 352-356, 73-82 and 86-88. 

12. A process as claimed in claim 1 1 , where the mutant has at least one of the 
following mono- or polyamino acid substitutions: 

a) Phe87Val; 

b) Phe87Val, Leu188Gln; or 

c) Phe87Val, Leu188Gln, Ala74Gly. 

13. A process for microbiological oxidation of optionally substituted mono- or 
polynuclear aromatics, straight-chain or branched alkanes or alkenes, or 
optionally substituted cycloalkanes or cycloalkenes, which comprises 

a1 ) culturing the recombinant cytochrome P450-producing microorganism as 
claimed in claim 7 in a culture medium, in the presence of an exogenous 
or intermediately formed substrate; or 

a2) incubating a substrate-containing reaction medium with a cytochrome 
P450 monooxygenase derived from cytochrome P450 monooxygenase 
BM-3 from Bacillus megaterium having an amino acid sequence 
according to SEQ ID NO:2, which has at least one functional mutation in 
at least one of the amino acid sequence regions 172-224, 39-43. 48-52. 

3 



100311% ... O ,1, .1, 7" ID! E 



CLAIMS AS FILED - 50915 

67-70, 330-335, 352-356, 73-82 and 86-88; and 
b) isolating the oxidation product formed or a secondary product thereof 

from the medium; 
where the monooxygenase mutant Phe87Val is not excluded. 
14. A process as claimed in claim 13, wherein the exogenous or intermediately 
formed substrate is selected from: 

a) optionally substituted mono- or polynuclear aromatics; 

b) straight-chain or branched alkanes and alkenes; 

c) optionally substituted cycloalkanes and cycloalkenes. 

16. A process as claimed in claim 13, where the cytochrome P450 monooxygenase 
has at least one of the following mono- or polyamino acid substitutions: 

a) Phe87Val; 

b) Phe87Val, Leu188Gln; or 

c) Phe87Val, Leu188Gln, Ala74Gly. 

17. A process as claimed in claim 9, wherein, as exogenous substrate, at least one 
compound selected from the groups a) to d) of compounds defined above is 
added to a medium and the oxidation is carried out by enzymatic reaction of the 
substrate-containing medium in the presence of oxygen at a temperature of 
approximately 20 to 40°C and a pH of approximately 6 to 9, where the substrate- 
containing medium additionally contains an approximately 10- to 100-fold molar 
excess of reduction equivalents based on the substrate. 
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CLAIMS AS FILED - 50915 

18. A process as claimed in claim 17, wherein, as exogenous substrate, a compound 
selected from indole, n-hexane, n-octane, n-deoane, n-dodecane, cumene, 1- 
methylindole, a- 6- or v-ionone, acridine, naphthalene, 6-methyl- or 8- : 
methylquinoline, quinoline and quinaldine is employed. 

19. A process for the microbiological production of indigo and/or indixubin, which 1 
comprises 

a1 ) culturing a recombinant microorganism which produces an indole- 

oxidizing cytochrome P450 in a culture medium, in the presence of 

exogenous or intermediately formed indole; or 
a2) incubating an indole-containing reaction medium with an indole-oxidizing 

cytochrome P450 monooxygenase; and 
b) isolating the oxidation product formed or a secondary product thereof 

from the medium. 

20. A process as claimed in claim 19, wherein the indigo and/or indirubin obtained, 
which was produced by oxidation of intermediately formed indole, is isolated 
from the medium. 

21 . A process as claimed in claim 20, wherein the indole oxidation is carried out by 
culturing the microorganisms in the presence of oxygen at a culturing 
temperature of approximately 20 to 40°C and a pH of approximately 6 to 9. 

22. A process as claimed in claim 20, where the monooxygenase is derived from 
cytochrome P450 monooxygenase BM-3 from Bacillus megaterium having an 
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CLAIMS AS FILED - 50915 

amino acid sequence according to SEQ ID NO:2, which has at least one 
functional mutation in at least one of the amino acid sequence regions 172-224, 
39-43. 48-52. 67-70, 330-335, 352-356, 73-82 and 86-88, including the 
substitution Phe87Val. 

23. A process as claimed in claim 22, where the monooxygenase has at least one of 
the following mono- or polyamino acid substitutions: 

a) Phe87Val; 

b) Phe87Val, Leu188G!n; or 

c) Phe87Val, Leu188Gln, Ala74Gly. 

24. A bioreactor comprising the cytochrom P450 monooxygenase as claimed in 
claim 1 or a recombinant microorganism transformed by a vector comprising an 
expression construct comprising a nucleic acid sequence coding for the 
cytochrom P450 monooxygenase of claim 1 in immobilized form. 
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Novel cytochrome P450 monooxvgenases and their use for the 
oxi dation of organ ic compounds 

5 The present invention relates to novel cytochrome P450 

monooxygenases with modified substrate specificity which are 
capable of the oxidation of organic substrates, for example 
N-heterocyclic aromatic compounds, nucleotide sequences coding 
therefor, expression constructs and vectors comprising these 
10 sequences, microorganisms transformed therewith, processes for 

the microbiological oxidation of various organic substrates, such 
as N-heterocyclic aromatic compounds and in particular processes 
for the preparation of indigo and indirubin. 

15 Enzymes having novel functions and properties can be prepared 

either by screening of natural samples or by protein engineering 
of known enzymes. Under certain circumstances, the last-mentioned 
method can be the more suitable to induce characteristics whose 
generation by the natural selection route is improbable. Despite 

20 numerous attempts at the engineering of enzymes, up to now there 
are only a few successful studies for promoting the catalytic 
activity of enzyme mutants with respect to a certain substrate 
(1-10). In these known cases, the substrates are structurally 
closely related to the native substrate of the respective enzyme. 

25 As yet, there are no reports on the successful engineering of 
enzymes which, after modification, catalyze the reaction of a 
compound which structurally is completely different from the 
native substrate of the enzyme. 

30 The cytochrome P450 monooxygenase isolatable from the bacterium 
Bacillus megaterium usually catalyzes the subterminal 
hydroxylation of long-chain, saturated acids and the 
corresponding amides and alcohols thereof or the epoxidation of 
unsaturated long-chain fatty acids or saturated fatty acids of 

35 medium chain length (11-13). The optimal chain length of 

saturated fatty acids is 14 to 16 carbon atoms. Fatty acids 
having a chain length of less than 12 are not hydroxylated (11). 

The structure of the heme domain of P450 BM-3 was determined by 
40 X-ray structural analysis (14-16). The substrate binding site is 
present in the form of a long tunnel-like opening which extends 
from the surface of the molecule as far as the heme molecule and 
is almost exclusively bordered by hydrophobic amino acid 
residues. The only charged residues on the surface of the heme 
45 domain are the residues Arg47 and Tyr51. It is assumed that these 
are involved in the binding of the carboxylate group of the 
substrate by formation of a hydrogen bond (14). The mutation of 
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Arg4 7 to Glu brings about inactivation of the enzyme for 
arachidonic acid (13), but increases its activity compared with 
Ci2-Ci4-alkyltrimethylammonium compounds (17). Substrate 
utilization for aromatic compounds, in particular mono-, bi- or 
5 polynuclear, if desired heterocyclic, aromatics, alkanes, 

alkenes, cycloalkanes and cycloalkenes , has not been described 
for this enzyme. Until now, it was therefore assumed in 
specialist circles that substrates other than the organic 
substrates hitherto described, for example indole, on account of 
10 the clear structural differences from the native substrates of 
P450 BM-3, in particular on account of the absence of functional 
groups which could bind to the abovementioned residues in the 
substrate pocket, are not a substrate. 

15 It is an object of the present invention to make available novel 
cytochrome P450 monooxygenases having modified substrate 
specificity or modified substrate profile. In particular, 
monooxygenase mutants are to be provided which, in comparison 
with the nonmutated wild- type enzyme, are enzymatically active 

20 with structurally clearly different substrates. 

Compared to the wild-type enzyme, a "modified substrate profile" 
can be observed for the mutants according to the invention. In 
particular, for the mutant in question, an improvement in 

25 reactivity is observed, for example an increase of the specific 
activity (expressed as nmol of converted substrate /minute /nmol of 
P450 enzyme) and/or of at least one kinetic parameter selected 
from the group consisting of Kcat, Km and Kcat/Km (for example by 
at least 1%, such as 10 to 1000%, 10 to 500% or 10 to 100%) in 

30 the conversion of at least one of the oxidizable compounds 

defined in groups a) to d) . The oxidation reaction according to 
the invention comprises the enzyme-catalyzed oxygenation of at 
least one exogenous (i.e. added to the reaction medium) or 
endogenous (i.e. already present in the reaction medium) organic 

35 substrate. In particular, the oxidation reaction according to the 
invention comprises a mono- and/or polyhydroxylation, for example 
a mono- and/or dihydroxylation, at an aliphatic or aromatic C-H 
group, or an epoxidation at a C=C group which is preferably 
non-aromatic. Also possible are combinations of the above 

40 reactions. Moreover, the immediate reaction product can be 

converted further in the context of a non-enzymatic subsequent or 
side reaction. Such combinations of enzymatic and non-enzymatic 
processes likewise form part of the subject-matter of the present 
invention . 
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We have found that the above object is surprisingly achieved by 
means of novel cytochrome P450 monooxygenases which, for example, 
are capable of the oxidation of N-heterocyclic bi- or polynuclear 
aromatic compounds . 

5 

In particular, the invention relates to those monooxygenases 
whose substrate-binding region is capable by means of 
site-specific mutagenesis of the functional uptake of novel, for 
example N-heterocyclic substrates. 

10 

In a preferred embodiment of the invention, the novel 
monooxygenases are soluble, i.e. existent in non membrane-bound 
form, and enzymatically active in this form. 

15 The monooxygenases according to the invention are preferably 

derived from cytochrome P450 monooxygenases of bacterial origin, 
as derived, in particular, from cytochrome P4 50 monooxygenase 
BM-3 from Bacillus megaterlum having an amino acid sequence 
according to SEQ ID NO: 2, which has at least one functional 

20 mutation, i.e. promoting the oxidation of novel organic 

substrates (cf. in particular the groups a) to d) of compounds as 
defined below), for example N-heterocyclic mono-, bi- or 
polynuclear aromatic compounds, in one of the amino acid sequence 
regions 172-224 (F/G loop region), 39-43 (13-strand 1), 48-52 

25 (B-strand 2), 67-70 (B-strand 3), 330-335 (B-strand 5), 352-356 
(B-strand 8), 73-82 (helix 5) and 86-88 (helix 6). 

The cytochrome P450 monooxygenase mutants provided according to 
the invention are preferably capable of at least one of the 
30 following reactions: 

a) oxidation of unsubstituted or substituted N-, O- or 
S-heterocyclic mono-, bi- or polynuclear aromatic compounds; 

b) oxidation of unsubstituted or substituted mono- or 
35 polynuclear aromatics; 

c) oxidation of straight-chain or branched alkanes and alkenes; 
and 

d) oxidation of unsubstituted or substituted cycloalkanes and 
cycloalkenes . 

40 

Preferred monooxygenase mutants have at least one functional 
mutation, in particular amino acid substitution, in at least one 
of the sequence regions 73-82, 86-88 and 172-224. Thus, for 
example, Phe87 can be replaced by an amino acid having an 
45 aliphatic side chain, such as Ala, Val, Leu, in particular Val; 
Leul88 can be replaced by an amino acid having an amide side 
chain, such as Asn or, in particular, Gin; and Ala74 can be 
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replaced by another amino acid having an aliphatic side chain, 
such as Val and, in particular, Gly. 

Particularly preferred monooxygenase mutants of this type are 
5 those which have at least one of the following mono- or polyamino 
acid substitutions: 

1) Phe87Val; 

2) Phe87Val, Leul88Gln; or 

10 3) Phe87Val, Leul88Gln, Ala74Gly; 

and functional equivalents thereof. The number indicates the 
position of the mutation; the original amino acid is indicated 
before the number and the newly introduced amino acid after the 
15 number. 

In this context, "functional equivalents" or analogs of the 
mutants which are disclosed specifically are mutants differing 
therefrom which furthermore have the desired substrate 
20 specificity with respect to at least one of the oxidation 
reactions a) to d) described above, i.e., for example, for 
heterocyclic aromatics and which hydroxylate, for example, 
indole, or furthermore exhibit the desired "modified substrate 
profile" with respect to the wild-type enzyme. 

25 

"Functional equivalents" are also to be understood as meaning in 
accordance with the invention mutants which exhibit, in at least 
one of the abovementioned sequence positions, an amino acid 
substitution other than the one mentioned specifically, but still 

30 lead to a mutant which, like the mutant which has been mentioned 
specifically, show a "modified substrate profile" with respect to 
the wild-type enzyme and catalyze at least one of the 
abovementioned oxidation reactions. Functional equivalence exists 
in particular also in the case where the modifications in the 

35 substrate profile correspond qualitatively, i.e. where, for 
example, the same substrates are converted, but at different 
rates . 

"Functional equivalents" naturally also encompass P450 
40 monooxygenase mutants which, like the P450 BM3 mutants which have 
been mentioned specifically, can be obtained by mutating P450 
enzymes from other organisms. For example, regions of homologous 
sequence regions can be identified by sequence comparison. 
Following the principles of what has been set out specifically in 
45 the invention, the modern methods of molecular modeling then 
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allow equivalent mutations to be carried out which affect the 
reaction pattern. 

"Functional equivalents" also encompass the mutants which can be 
5 obtained by one or more additional amino acid additions, 

substitutions, deletions and/or inversions, it being possible for 
the abovementioned additional modifications to occur in any 
sequence position as long as they give rise to a mutant with a 
modified substrate profile in the above sense. 

10 

Substrates of group a) which can be oxidized according to the 
invention are unsubstituted or substituted heterocyclic mono-, 
bi- or polynuclear aromatic compounds; in particular oxidizable 
or hydroxylatable N-, O- or S-heterocyclic mono-, bi- or 

15 polynuclear aromatic compounds. They include preferably two or 
three, in particular two, 4- to 7-membered, in particular 6- or 
5-membered, fused rings, where at least one, preferably all, 
rings have aromatic character and where at least one of the 
aromatic rings carries one to three, preferably one, N-, O- or 

20 S-heteroatom in the ring. The total ring structure may contain 
one or two further identical or different heteroatoms . The 
aromatic compounds may furthermore carry 1 to 5 substituents at 
the ring carbon or heteroatoms . Examples of suitable substituents 
are Cx- to C 4 -alkyl, such as methyl, ethyl, n- or isopropyl, n-, 

25 iso- or t-butyl, or C 2 - to C 4 -alkenyl, such as ethenyl, 

1-propenyl, 2-propenyl, 1-butenyl, 2-butenyl or 3-butenyl, 
hydroxyl and halogen, such as F, CI and Br. The alkyl or alkenyl 
substituents mentioned may also have a keto or aldehyde group; 
examples being propan-2-on-3-yl, butan-2-on-4-yl , 

30 3-buten-2-on-4-yl . Non-limiting examples of suitable heterocyclic 
substrates are, in particular, binuclear heterocycles , such as 
indole, N-methyl-indole , and the substituted analogs thereof 
which carry one to three of the above-defined substituents on 
carbon atoms, for example 5-chloro- or 5-bromoindole ; and also 

35 quinoline and quinoline derivatives, for example 

8-methylquinoline, 6-methyl-quinoline and quinaldine; and 
benzothiophene , and the substituted analogs thereof which carry 
one to three of the above-defined substituents on carbon atoms. 
Moreover, trinuclear hetero-aromatics , such as acridine and the 

40 substituted analogs thereof which carry one to three of the 
above-defined substituents on carbon atoms, may be mentioned. 

Substrates of group b) which are oxidizable according to the 
invention are unsubstituted or substituted mono- or polynuclear, 
45 in particular mono- or binuclear, aromatics, such as benzene and 
naphthalene. The aromatic compounds may be unsubstituted or mono- 
or polysubstituted and, for example, carry 1 to 5 substituents on 
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the ring carbon atoms. Examples of suitable substituents are Ci— 
to C 4 -alkyl, such as methyl, ethyl, n- or isopropyl or n-, iso- or 
t-butyl, or C2- to Cj-alkenyl, such as ethenyl, 1-propenyl, 
2-propenyl, 1-butenyl, 2-butenyl or 3-butenyl, hydroxyl and 
5 halogen, such as F, CI and Br. The alkyl or alkenyl substituents 
mentioned may also have a keto or aldehyde group; Examples being 
propan-2-on-3-yl, butan-2-on-4-yl , 3-buten-2-on-4-yl . The 
aromatic may be fused with a four- to seven-membered non-aromatic 
ring. The non-aromatic ring may have one or two C=C double bonds, 

10 be mono- or polysubstituted by the abovementioned substituents 
and may carry one or two hetero ring atoms . Examples of 
particularly suitable aromatics are mononuclear aromatics, such 
as cumene, and binuclear substrates, such as indene and 
naphthalene, and substituted analogs thereof which carry one to 

15 three of the above-defined substituents on carbon atoms. 

Substrates of group c) which can be oxidized according to the 
invention are straight-chain or branched alkanes or alkenes 
having 4 to 15, preferably 6 to 12, carbon atoms. Examples which 

20 may be mentioned are n-butane, n-pentane, n-hexane, n-heptane, 
n-octane, n-nonane, n-decane, n-undecane and n-dodecane, and the 
analogs of these compounds which are branched once or more than 
once, for example analogous compounds having 1 to 3 methyl side 
groups; or mono- or polyunsaturated, for example 

25 mono-unsaturated, analogs of the abovementioned alkanes. 

Substrates of group d) which can be oxidized according to the 
invention are unsubstituted or substituted cycloalkanes and 
cycloalkenes having 4 to 8 ring carbon atoms . Examples of these 

30 are cyclopentane, cyclopentene , cyclohexane, cyclohexene, 

cycloheptane and cycloheptene . The ring structure may carry one 
or more, for example 1 to 5 , substituents according to the above 
definition for compounds of groups a) and b) . Nonlimiting 
examples are ionones, such as a-, (3- and y-ionone, and the 

35 corresponding methyl ionones and iso-methyl ionones . Particular 
preference is given to a- and (3- ionone . 

The invention also relates to nucleic acid sequences coding for 
one of the monooxygenases according to the invention. Preferred 

40 nucleic acid sequences are derived from SEQ ID NO:l, which have 
at least one nucleotide substitution which leads to one of the 
functional amino acid mutations described above. The invention 
moreover relates to functional analogs of the nucleic acids 
obtained by addition, substitution, insertion and/or deletion of 

45 individual or multiple nucleotides, which furthermore code for a 
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monooxygenase having the desired substrate specificity, for 
example having indole-oxidizing activity. 

The invention also encompasses those nucleic acid sequences which 
5 comprise so-called silent mutations or which are modified in 
comparison with a specifically mentioned sequence in accordance 
with the codon usage of a specific origin or host organism, and 
naturally occurring variants of such nucleic acid sequences. The 
invention also encompasses modifications of the nucleic acid 

10 sequences obtained by degeneration of the genetic code (i.e. 

without any changes in the corresponding amino acid sequence) or 
conservative nucleotide substitution (i.e. the corresponding 
amino acid is replaced by another amino acid of the same charge, 
size, polarity and/or solubility), and sequences modified by 

15 nucleotide addition, insertion, inversion or deletion, which 
sequences encode a monooxygenase according to the invention 
having a "modified substrate profile", and the corresponding 
complementary sequences. 

20 The invention furthermore relates to expression constructs 

comprising a nucleic acid sequence encoding a mutant according to 
the invention under the genetic control of regulatory nucleic 
acid sequences ; and vectors comprising at least one of these 
expression constructs. 

25 

Preferably, the constructs according to the invention encompass a 
promoter 5 ' -upstream of the encoding sequence in question and a 
terminator sequence 3 ' -downstream, and, optionally, further 
customary regulatory elements, and, in each case operatively 

30 linked with the encoding sequence. Operative linkage is to be 
understood as meaning the sequential arrangement of promoter, 
encoding sequence, terminator and, if appropriate, other 
regulatory elements in such a manner that each of the regulatory 
elements can fulfill its intended function on expression of the 

35 encoding sequence. Examples of operatively linkable sequences are 
targeting sequences, or else translation enhancers, enhancers, 
polyadenylation signals and the like. Further regulatory elements 
encompass selectable markers, amplification signals, replication 
origins and the like. 

40 

In addition to the artificial regulatory sequences, the natural 
regulatory sequence can still be present upstream of the actual 
structural gene. If desired, this natural regulation may be 
switched off by genetic modification, and the expression of the 
45 genes may be enhanced or lowered. However, the gene construct may 
also be simpler in construction, i.e. no additional regulatory 
signals are inserted upstream of the structural gene and the 
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natural promoter with its regulation is not removed. Instead, the 
natural regulatory sequence is mutated in such a way that 
regulation no longer takes place and the gene expression is 
increased or reduced. One or more copies of the nucleic acid 
5 sequences may be present in the gene construct. 

Examples of suitable promoters are: cos, tac, trp, tet, trp-tet, 
lpp, lac, lpp-lac, laclq, T7 , T5 , T3 , gal, trc, ara, SP6, 1-PR or 
1-PL promoter, which are advantageously employed in Gram-negative 

10 bacteria; and Gram-positive promoters amy and SP02, the yeast 

promoters ADC1, MFa, Ac, P-60, CYC1, GAPDH or the plant promoters 
CaMV/35S, SSU, OCS, lib4 , usp, STLS1, B33, nos or the ubiquitin 
or phaseolin promoter. Particular preference is given to using 
inducible promoters, for example light- and in particular 

15 temperature-inducible promoters, such as the P r Pi promoter. 

In principle, all natural promoters with their regulatory 
sequences can be used. In addition, synthetic promoters may also 
be used in an advantageous fashion. 

20 

The abovementioned regulatory sequences are intended to allow the 
targeted expression of the nucleic acid sequences and of protein 
expression. Depending on the host organism, this may mean, for 
example, that the gene is expressed or overexpressed only after 
25 induction has taken place, or that it is expressed and/or 
overexpressed immediately. 

The regulatory sequences or factors can preferably have a 
positive effect on expression and in this manner increase or 
30 reduce the latter. Thus, an enhancement of the regulatory 

elements may advantageously take place at the transcriptional 
level by using strong transcription signals such as promoters 
and/or "enhancers". In addition, translation may also be enhanced 
by improving, for example, mRNA stability. 

35 

An expression cassette is generated by fusing a suitable promoter 
with a suitable monooxygenase nucleotide sequence and a 
terminator signal or polyadenylation signal. To this end, 
customary recombination and cloning techniques are used as they 

40 are described, for example, in T. Maniatis, E.F. Fritsch and 

J. Sambrook, Molecular Cloning: A Laboratory Manual, Cold Spring 
Harbor Laboratory, Cold Spring Harbor, NY (1989) and in 
T.J. Silhavy, M.L. Berman and L.W. Enquist, Experiments with Gene 
Fusions, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 

45 (1984) and in Ausubel, F.M. et al., Current Protocols in 
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Molecular Biology, Greene Publishing Assoc. and Wiley 
Interscience (1987). 

For expression in a suitable host organism, the recombinant 
5 nucleic acid construct or gene construct is advantageously 

inserted into a host-specific vector which allows optimal gene 
expression in the host. Vectors are well known to the skilled 
worker and can be found, for example, in "Cloning Vectors" 
(Pouwels P.H. et al., Ed., Elsevier, Amsterdam-New York-Oxford, 

10 1985). Vectors are to be understood as meaning not only plasmids, 
but all other vectors known to the skilled worker such as, for 
example, phages, viruses, such as SV4 0, CMV, baculovirus and 
adenovirus, transposons, IS elements, phasmids , cosmids , and 
linear or circular DNA. These vectors can be replicated 

15 autonomously in the host organism or chromosomal ly. 

The vectors according to the invention allow the generation of 
recombinant microorganisms which are transformed, for example, 
with at least one vector according to the invention and which can 

20 be employed for producing the mutants. The above-described 
recombinant constructs according to the invention are 
advantageously introduced into a suitable host system and 
expressed. It is preferred to use usual cloning and transfection 
methods known to the skilled worker in order to bring about 

25 expression of the abovementioned nucleic acids in the expression 
system in question. Suitable systems are described, for example, 
in current protocols in molecular biology, F. Ausubel et al., 
Ed., Wiley Interscience, New York 1997. 

30 Suitable host organisms are, in principle, all organisms which 
allow expression of the nucleic acids according to the invention, 
their allelic variants, and their functional equivalents or 
derivatives. Host organisms are to be understood as meaning, for 
example, bacteria, fungi, yeasts or plant or animal cells. 

35 Preferred organisms are bacteria such as those of the genera 
Escherichia, such as, for example, Escherichia coli, 
Streptomyces , Bacillus or Pseudomonas, eukaryotic microorganisms 
such as Saccharomyces cerevisiae, Aspergillus, and higher 
eukaryotic cells from animals or plants, for example Sf9 or CHO 

40 cells. 

If desired, expression of the gene product may also be brought 
about in transgenic organisms such as transgenic animals such as, 
in particular, mice, sheep, or transgenic plants. The transgenic 
45 organisms may also be knock-out animals or plants in which the 
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corresponding endogenous gene has been eliminated, such as, for 
example, by mutation or partial or complete deletion. 

Successfully transformed organisms can be selected by marker 
5 genes which are likewise contained in the vector or in the 

expression cassette. Examples of such marker genes are genes for 
resistance to antibiotics and for enzymes which catalyze a color 
reaction, which causes staining of the transformed cell. These 
transformed cells can then be selected using automatic cell 

10 selection. Microorganisms which have been transformed 

successfully with a vector and which carry an appropriate gene 
for resistance to antibiotics (for example G418 or hygromycin) 
can be selected by using appropriate antibiotics-containing media 
or substrates. Marker proteins which are presented on the cell 

15 surface can be used for selection by affinity chromatography. 

The combination of the host organisms and the vectors appropriate 
for the organisms, such as plasmids, viruses or phages, such as, 
for example, plasmids with the RNA polymerase/promoter system, 
20 phages X, |j. or other temperate phages or transposons and/or other 
advantageous regulatory sequences forms an expression system. The 
term "expression system" means, for example, a combination of 
mammalian cells such as CHO cells, and vectors, such as pcDNA3neo 
vector, which are suitable for mammalian cells. 

25 

As described above, the gene product can also be expressed 
advantageously in transgenic animals, for example mice, sheep, or 
transgenic plants. It is likewise possible to program cell-free 
translation systems with the RNA derived from the nucleic acid. 

30 

The invention furthermore provides a process for preparing a 
monooxygenase according to the invention, which comprises 
cultivating a monooxygenase-producing microorganism, if 
appropriate inducing the expression of the monooxygenase, and 
35 isolating the monooxygenase from the culture. If desired, the 
monooxygenase according to the invention can thus also be 
produced on an industrial scale. 

The microorganism can be cultivated and fermented by known 
40 methods. Bacteria, for example, can be grown in a TB or LB medium 
and at 20-40°C and a pH of 6-9. Suitable cultivation conditions 
are described in detail in T. Maniatis, E.F. Fritsch and 
J. Sambrook, Molecular Cloning: A Laboratory Manual, Cold Spring 
Harbor Laboratory, Cold Spring Harbor, NY (1989), for example. 
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If the monooxygenase is not secreted into the culture medium, the 
cells are then lyzed and the monooxygenase is obtained from the 
lysate using known methods for the isolation of proteins. The 
cells can be lyzed alternatively by high-frequency ultrasound, by 
5 high pressure, for example in a French pressure cell, by 

osmolysis, by the action of detergents, lytic enzymes or organic 
solvents, by homogenization or by a combination of a plurality of 
the processes mentioned. Purification of the monooxygenase can be 
achieved by known chromatographic processes, such as molecular 

10 sieve chromatography (gel filtration), such as Q-Sepharose 
chromatography, ion-exchange chromatography and hydrophobic 
chromatography, and by other customary processes, such as 
ultrafiltration, crystallization, salting out, dialysis and 
native gel electrophoresis. Suitable processes are described, for 

15 example, in Cooper, F.G., Biochemische Arbeitsmethoden 

[Biochemical Procedures], Verlag Walter de Gruyter, Berlin, New 
York or in Scopes, R. , Protein Purification, Springer Verlag, New 
York, Heidelberg, Berlin. 

20 To isolate the recombinant protein, it is particularly 

advantageous to use vector systems or oligonucleotides which 
extend the cDNA by certain nucleotide sequences and thus code for 
modified polypeptides or fusion proteins which serve to simplify 
purification. Suitable modifications of this type are, for 

25 example, so-called "tags" which act as anchors, such as, for 
example, the modification known as hexa-histidine anchor, or 
epitopes which can be recognized as antigens by antibodies 
(described, for example, in Harlow, E. and Lane, D., 1988, 
Antibodies: A Laboratory Manual. Cold Spring Harbor (N.Y. ) 

30 Press). These anchors can be used to attach the proteins to a 
solid support such as, for example, a polymer matrix, which can, 
for example, be packed into a chromatography column, or to a 
microtiter plate or to another support. 

35 These anchors can also at the same time be used to recognize the 
proteins, it is also possible to use for recognition of the 
proteins conventional markers such as fluorescent dyes, enzyme 
markers which form a detectable reaction product after reaction 
with a substrate, or radioactive markers, alone or in combination 

40 with the anchors for derivatizing the proteins. 

The invention moreover relates to a process for the 
microbiological oxidation of organic compounds, for example 
N-heterocyclic mono-, bi- or polynuclear aromatic compounds 
45 according to the above definition, which comprises 
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al) culturing a recombinant microorganism according to the above 
definition in a culture medium, in the presence of an 
exogenous (added) substrate or an intermediately formed 
substrate, which substrate is oxidizable by the monooxygenase 
5 according to the invention, preferably in the presence of 

oxygen (i.e. aerobically) ; or 

a2 ) incubating a substrate-containing reaction medium with an 

enzyme according to the invention, preferably in the presence 
of oxygen and an electron donor; and 
10 b) isolating the oxidation product formed or a secondary product 
thereof from the medium. 

The oxygen required for the reaction either passes from the 
atmosphere into the reaction medium or, if required, can be added 
15 in a manner known per se. 

The oxidizable substrate is preferably selected from 

a) unsubstituted or substituted N-heterocyclic mono-, bi- or 
20 polynuclear aromatic compounds; 

b) unsubstituted or substituted mono- or polynuclear aromatics; 

c) straight-chain or branched alkanes and alkenes; 

d) unsubstituted or substituted cycloalkanes and cycloalkenes . 

25 A preferred process variant is directed to the formation of 
indigo/indirubin and is characterized by the fact that the 
substrate is indole formed as an intermediate in the culture and 
that the indigo and/or indirubin formed in the culture medium is 
isolated by oxidation of hydroxyindole intermediates . 

30 

If the oxidation according to the invention is carried out using 
a recombinant microorganism, the culturing of the microorganisms 
is preferably first carried out in the presence of oxygen and in 
a complex medium, such as, for example, TB or LB medium at a 

35 culturing temperature of approximately 20 to 4 0°C and a pH of 

approximately 6 to 9 , until an adequate cell density is reached. 
The addition of exogenous indole is usually not necessary, as 
this is intermediately formed by the microorganism. However, when 
using other substrates, addition of exogenous substrate may be 

40 required. In order to be able to control the oxidation reaction 
better, the use of an inducible, in particular temperature- 
inducible, promoter is preferred. The temperature is in this case 
increased to the necessary induction temperature, e.g. 42°C in the 
case of the P r Pi promoter, this is maintained for a sufficient 

45 period of time, e.g. 1 to 10 or 5 to 6 hours, for the expression 
of the monooxygenase activity and the temperature is then reduced 
again to a value of approximately 30 to 40°C. The culturing is 
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then continued in the presence of oxygen for 12 hours to 3 days. 
The pH can, in particular in the case of indole oxidation, be 
increased by addition of NaOH, e.g. to 9 to 10, whereby the 
indigo formation or indirubin formation is additionally promoted 
5 by atmospheric oxidation of the enzymatically formed oxidation 
products 2- and 3-hydroxyindole . 

The indigo/indirubin formation according to the invention is 
illustrated by the reaction scheme below: 

10 




indole 



H 



15 



P450 BM-3 
mutant 



20 




IO 




Air 

oxidation 



dimerizatioif^ 



H 



H 



25 



I 



II 



indigo 




30 



Air 

oxidation 
dimerization 



35 




indirubin 



I: 2 -hydroxy indole (oxindole) 
II: 3-hydroxyindole (indoxyl) 



40 



45 However, if the oxidation according to the invention is carried 
out using purified or enriched enzyme mutants, the enzyme 
according to the invention is dissolved in an exogenous 
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substrate-containing, for example indole-containing medium 
(approximately 0.01 to 10 mM, or 0.05 to 5 mM) , and the reaction 
is carried out, preferably in the presence of oxygen, at a 
temperature of approximately 10 to 50°C, such as, for example, 30 
5 to 40°C, and a pH of approximately 6 to 9 (such as established, 
for example, using 100 to 200 mM phosphate or tris buffer) , and 
in the presence of a reductant, the substrate-containing medium 
moreover containing, relative to the substrate to be oxidized, an 
approximately 1- to 100-fold or 10- to 100-fold molar excess of 
10 reduction eguivalents. The preferred reductant is NADPH. If 
required, the reducing agent can be added in portions. 

In a similar manner, the oxidizable substrates which are 
preferably used are: n-hexane, n-octane, n-decane, n-dodecane, 
15 cumene, 1-methylindole , 5-C1- or Br-indole, indene, 

benzothiophene , a — , |3— and y — ionone, acridine, naphthalene, 
6-methyl- or 8-methylquinoline , quinoline and quinaldine . 

The enzymatic oxidation reaction according to the invention can 
20 be carried out, for example, under the following conditions: 



Substrate concentration: 
Enzyme concentration: 

25 

Reaction temperature: 
P H: 
30 Buffer: 

Electron donor: 

35 



from 0.01 to 2 0 mM 
from 0.1 to 10 mg/ml 
from 10 to 50°C 
from 6 to 8 

from 0.05 to 0.2 M potassium 
phosphate, or Tris/HCl 

is preferably added in portions 
(initial concentration about 
0.1 to 2 mg/ml) 



The mixture can briefly (from 1 to 5 minutes) be preincubated (at 
about 20-40°C) before the reaction is initiated, for example by 
adding the electron donors (e.g. NADPH). The reaction is carried 
40 out aerobically, if appropriate with additional introduction of 
oxygen . 

In the substrate oxidation process according to the invention, 
oxygen which is present in or added to the reaction medium is 
45 cleaved reductively by the enzyme. The required reduction 
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equivalents are provided by the added reducing agent (electron 
donor) . 

The oxidation product formed can then be separated off from the 
5 medium and purified in a conventional manner, such as, for 
example, by extraction or chromatography. 

Further subjects of the invention relate to bioreactors, 
comprising an enzyme according to the invention or a recombinant 
10 microorganism according to the invention in immobilized form. 

A last subject of the invention relates to the use of a 
cytochrome P450 monooxygenase according to the invention or of a 
vector or microorganism according to the invention for the 
15 microbiological oxidation of a substrate from one of the groups 
a) to d) , in particular of N-heterocyclic mono-, bi- or 
polynuclear aromatic compounds, and preferably for the formation 
of indigo and/or indirubin. 

20 The present invention is now described in greater detail with 
reference to the following examples. 

Example 1 : 

25 Randomization of specific codons of P450 BM-3 

The experiments were carried out essentially as described in 
(19). Three positions (Phe87, Leul88 and Ala74) were randomized 
with the aid of site-specific mutagenesis using the Stratagene 
30 QuikChange kit (La Jolla, CA, USA) . The following PCR primers 
were used for the individual positions: 

Phe87: 5 ' -gcaggagacgggttgnnnacaagctggacg-3 ' (SEQ ID NO:3), 
5 ' -cgtccagcttgtnnncaacccgtctcctgc-3 ' , (SEQ ID NO: 4) 
35 Leul88: 5 ' -gaagcaatgaacaagnnncagcgagcaaatccag-3 ' (SEQ ID NO:5), 
5 ' -ctggatttgctcgctgnnncttgttcattgcttc-3 ' (SEQ ID NO: 6); 
Ala74: 5 ' -gctttgataaaaacttaaagtcaannncttaaatttgtacg-3 ' (SEQ ID: 
NO : 7 ) , 

5 ' -cgtacaaatttaagnnnttgacttaagtttttatcaaagc-3 ' (SEQ ID 
40 NO: 8) 

The conditions for the PCR were identical for all three 
positions. In particular, 17.5 pmol of one of each primer, 
20 pmol of template plasmid DNA, 3 U of the Pfu polymerase and 
45 3.2 5 nmol of each dNTP were used per 50 u.1 reaction volume. The 
PCR reaction was started at 94°C/1 min and the following 
temperature cycle was then carried out 20 times: 94°C, 1 min; 
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46°C, 2.5 min; 72°C, 17 min. After 20 cycles, the reaction was 
continued at 7 2°C for 15 min. After the PCR, the template DNA was 
digested at 37°C for 3 h using 20 U of Dpnl . E. coli DH5a was then 
transformed. The transformed E. coll DH5a cells were plated out 
5 onto LB agar plates which contained 150 [xg/ml of ampicillin. 
Incubation was then carried out at 37°C for 18 h. 

Example 2 : 

Expression and purification of the P450 BM-3 and its mutants and 
10 production of a blue pigment 

The P450 BM-3 gene and the mutants thereof were expressed under 
the control of the strong, temperature-inducible PrPl promoter of 
the plasmid pCYTEXPl in E. coll DH5cc as already described (20). 

15 Colonies were picked up using sterile toothpicks and transferred 
to microtiter plates having 96 hollows, comprising 200 ^il of TB 
medium and 100 |j.g/ml of ampicillin per hollow. Incubation was 
then carried out at 37°C overnight. 40 ^1 of the cell culture of 
one of each hollow were then transferred to a culture tube which 

20 contained 2 ml of TB medium with 100 ug/ml of ampicillin. 

Culturing was then carried out at 37°C for 2 h. The temperature 
was then increased to 42°C for 6 h for induction. Culturing was 
then continued at 37°C overnight, a blue pigment being produced. 

25 The preparative production of enzyme or blue pigment was carried 
out starting from a 300 ml cell culture (OD5 78nm = 0.8 to 1.0). 
For the isolation of the enzyme, the cells were centrifuged off 
at 4000 rpm for 10 min and resuspended in 0.1 M K x P0 4 buffer, pH 
7.4. The ice-cooled cells were carefully disrupted with the aid 

30 of a Branson sonifer W25 ( Dietzenbach, Germany) at an energy 

output of 80 W by 2 min sonification three times. The suspensions 
were centrifuged at 32570 x g for 2 0 min. The crude extract was 
employed for the activity determination or for the enzyme 
purification. The enzyme purification was carried out as already 

35 described in (21), to which reference is expressly made hereby. 
The concentration of purified enzyme was determined by means of 
the extinction difference at 450 and 490 nm, as already described 
in (11), using an extinction coefficient e of 91 mM -1 cm -1 . 

40 Example 3 : 

Isolation of mutants which produce large amounts of blue pigment 

100 colonies in each case were isolated from the mutants of one 
45 of each position, which were produced by randomized mutagenesis 
of the codon of the corresponding position. These colonies were 
cultured in culture tubes for the production of blue pigment. 
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After washing the cells with water and a number of slow 
centrif ugation steps (500 rpm) , the blue pigment was extracted 
using dimethyl sulfoxide (DMSO) . The solubility of the blue 
pigment was greatest in DMSO. The absorption of the extract was 
5 determined at 6 77 nm. That mutant which produced the largest 
amount of blue pigment, especially mutants from a specific 
position, was used for DNA sequencing (ABI DNA sequencing kit; 
ABI Prism™ 3 77 DNA sequencer) and moreover as a template for 
site-specific randomized mutagenesis. 

10 

Example 4 : 

Activity test for the indole hydroxylation 

15 The indole hydroxylation activity was tested in a solution which 
contained 8 u.1 of a 10-500 mM indole solution in DMSO, 850 ^1 of 
tris/HCl buffer (0.1 M, pH 8.2) and 0.6 nmol of P450 BM-3 wild 
type or mutant in a final volume of 1 ml. The mixture was 
preincubated for 9 min before the reaction was started by 

20 addition of 50 n-1 of an aqueous 1 mM solution of NADPH. The 

reaction was stopped after 20 sec by addition of 60 jaI of 1.2 M 
KOH. Within 5 to 30 sec (under aerobic conditions), the enzyme 
products were converted completely into indigo 
[A 2 ' 2 ' -biindoline ] -3 , 3 ' -dione ) and indirubin 

25 ( [A 2 ' 3 ' -biindoline] -2 ' ,3-dione) . The indigo production was 

determined by means of its absorption at 67 0 nm. A calibration 
curve using pure indigo showed an extinction coefficient of 
3.9 mM- 1 cm -1 at this wavelength. A linear curve was obtained for 
indigo production in a reaction time of 40 sec using 0.6 nmol of 

30 wild type or P4 50 BM-3 mutant and 0.05 to 5.0 mM of indole. 

Indirubin shows a very weak absorption at 6 70 nm and the amount 
of indirubin formed was very much smaller than the amount of 
indigo formed. The formation of indirubin was neglected in the 
determination of the kinetic parameters . The NADPH consumption 

35 was determined at 340 nm and calculated as described (17) using 
an extinction coefficient of 6.2 mM -1 cm -1 . 

Example 5 : 

40 Purification of indigo and indirubin 

After washing the cells with water and repeated centrif ugation at 
500 g, the blue pellet formed was extracted using tetrahydrof uran 
(THF). The extract was evaporated almost to dryness and the red 
45 pigment was extracted a number of times with 50 ml of absolute 
ethanol . The residual blue solid was dissolved in THF and 
analyzed by thin-layer chromatography (TLC). The ethanol solution 
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was evaporated and purified by silica gel chromatography (TLC 60, 
Merck, Darmstadt, Germany; 2 cm x 30 cm) before it was washed 
with THF and petroleum ether in a ratio of 1:2. The red solution 
obtained was evaporated and its purity was determined by TLC. The 
5 absorption spectra of the blue and of the red pigment were 
determined in a range from 4 00 to 80 0 nm with the aid of an 
Ultraspec 3000 spectrophotometer (Pharmacia, Uppsala, Sweden). 
The blue and the red color were moreover analyzed by mass 
spectrometry and ifi-NMR spectroscopy. 

10 

Experimental results 

1. Increasing the productivity for blue pigment by P450 BM-3 
mutagenesis 

15 

Native P450 BM-3 does not have the ability to produce the blue 
indigo-containing pigment, or the precursor substances 2- or 
3-hydroxyindole . In order to be able to prepare a sufficient 
amount of blue pigment, P450 BM-3 was subjected to evolution in a 

20 controlled manner. All mutants which produced the blue pigment 
were sequenced. It was found that at least one of the following 
three positions were mutated: Phe87, Leul88 and Ala74 . It was 
therefore assumed that these three positions play a crucial role 
for the activity of P450 BM-3 in the production of blue pigment. 

25 From the structure of the heme domain of cytochrome P450 BM-3, 
complexed with palmitoleic acid, it is seen that Phe87 prevents 
the substrate from coming closer to the heme group (14). The 
mutant Phe87Val shows a high regio- and stereoselectivity in the 
epoxidation of (14S, 15R) -arachidonic acid (13) and the mutant 

30 Phe87Ala shifts the hydroxylation position of to-1, co-2 and co-3 to 
co (22). The position 87 was therefore selected as first for the 
site-specific randomized mutagenesis with the aid of PCR. In tube 
cultures, 7 colonies were obtained which produced a small amount 
of blue pigment after induction. The colony which produced the 

35 largest amount of the blue pigment was selected for the DNA 
sequencing. The sequence data showed substitution of Phe87 by 
Val. The mutant Phe8 7Val was then used as a template for the 
second round of site-specific randomized mutagenesis on position 
Leul88. The structure of the heme domain, complexed with 

40 palmitoleic acid, shows that the repositioning of the F and G 
helices brings the residue Leul88 into direct contact with the 
substrate (14). This position can therefore play an important 
role in substrate binding or orientation. After the second 
screening passage, 31 colonies were observed which produced the 

45 blue pigment. The mutant which produced the largest amount of 
pigment contained the substitutions Phe87Val and Leul88Gln. This 
mutant was then mutated in position Ala74 in the third passage of 
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site-specific randomized mutagenesis. In this case the triple 
mutant F87L188A74 (Phe87Val, Leul88Gln and Ala74Gly) was 
obtained, which produced several mg of blue pigment in a 2-liter 
flask, containing 300 ml of TB medium. This amount was sufficient 
5 for the isolation and characterization of the blue pigment. 

2. Isolation and identification of the blue pigment 

After washing the cells, the residual blue pellet was extracted 
10 with THF and analyzed by TLC . The blue pigment was separated into 
a rapidly migrating blue component and into a more slowly 
migrating red component. Both components showed exactly the same 
mobility parameters as the components of a commercial indigo 
sample . 

15 

After the purification, the absorption spectra of both components 
were determined in DMSO. The blue component showed the same 
spectrum as a commercial indigo sample. The purified blue and red 
components were each analyzed by mass spectrometry. The mass 

20 spectra of both pigments showed a strong molecular ion peak at 
m/e = 262 and two fragment peaks at m/e = 234 and 205 (relative 
intensity in each case 10%). This pattern is typical of indigoid 
compounds. The elementary composition of these ions was 
determined by high-resolution mass spectrometry as Ci 6 Hi 0 N 2 O 2 , 

25 C15H10N2O and C14H9N2. This is also characteristic of structures of 
the indigo type. The blue pigment was thus identified as indigo 
and the red pigment as indirubin. For the confirmation of the 
structure, 500 MHz 1 H— NMR spectra of both pigments were carried 
out in DMSO-D 6 solution. The results agreed with the literature 

30 data (23) . 

3. Production of indigo using isolated enzymes 

It is known that indigo is accessible from indole by microbial 
35 transformation (24-26). None of these microbial systems, however, 
contained a P450 monooxygenase . According to the invention, the 
catalytic activity of the pure enzyme for indole was first 
determined. The mutant F87L188A74 was mixed with indole. No color 
reaction could be observed. Only after addition of NADPH to the 
40 reaction mixture was the blue pigment formed after approximately 
20 min. By adjustment of the pH of the reaction mixture to a 
value of approximately 11, 30 sec after addition of NADPH, the 
blue coloration was visible within a few seconds. Control 
experiments using native P450 BM-3 were always negative, even 
45 using increased concentrations of enzyme, indole and NADPH. The 
blue pigment was extracted using ethyl acetate and analyzed by 
TLC. The blue pigment again separated into a more rapidly running 
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blue component and into a slower running red component. The Rf 
values and the absorption spectra were identical to those values 
of the extracts from the fermentation broth. The F87L188A7 4 
mutant of P4 50 BM-3 is thus an indole hydroxylase. 

5 

Two routes have previously been described for the enzymatic 
transformation of indole to indigo. One route is catalyzed by a 
dioxygenase, the other by a styrene monooxygenase (24, 25). The 
NADPH stoichiometry is in both cases 2. It was therefore assumed 
10 that in contrast to the dioxygenases the mutant F87L188A74 
according to the invention hydroxylates indole in only one 
position to form oxindole ( 2-hydroxyindole ) or indoxyl 
( 3-hydroxyindole ) . 

15 4. Kinetic parameters of indole hydroxylation 

Pure samples of the wild-type enzyme P450 BM-3 and of the mutants 
Leul88Gln, Phe87Val, F87L188 and F87L188A74 were used for the 
determination of the kinetic parameters of indole hydroxylation. 
20 The results are summarized in Table 1 below. 



Table 1: Kinetic parameters of the P450 BM-3 mutants for 
indole hydroxylation 



25 



Mutants 


KcaUS- 1 ) 


I 


K m (mM) 




K cat /K m (M-ls-1) 


WT 


-a) 










Leul88Gln 


n.d. b > 




n.d. 




n.d. 


Phe87Val 


2.03 (0. 


.14) 


17.0 (1. 


.0) 


119 


F87L188 


2.28 (0. 


.16) 


4.2 (0. 


•4) 


543 


F87L188A74 


2.73 (0. 


,16) 


2.0 (0. 


.2) 


1365 



a) no activity was observed; 

b) not determined (activity was too low to be measured) 



Even with an excess of purified enzyme and high indole 
concentration, the wild-type enzyme is not able to oxidize 
indole. The mutant Leul88Gln shows a low activity. The mutant 
Phe87Val shows a catalytic activity of 119 M" 1 s- 1 for indole 
hydroxylation. The catalytic efficiency of the double mutant 
F87L188 (Phe87Val,Leul88Gln) increased to 543 M-is-i and was 
increased to 1365 M^s -1 by introduction of the further 
substitution Ala74Gly. The K cat values increased from Phe87Val to 
the triple mutant by a total of 3 5%, while the K m values decreased 
approximately by seven-fold. This indicates that Ala74Gly and 
Leul88Gln are mainly involved in substrate binding. 
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For the triple mutant F87L188A74, the indole turnover rate 
(K ca t=2.73 s- 1 ) was more than ten times higher than for most P450 
enzymes ( 18 ) . 



5 Example 6 



Hydroxylation of n-octane using modified cytochrome P450 
monooxygenase 

10 The reactions were carried out using a P4 5 0 BM-3 monooxygenase 
mutant comprising the following mutations: Phe87Val Leul88Gln 
Ala74Gly 

The chosen substrate was n-octane. For the hydroxylation of 
15 n-octane, the following aerobic reaction mixture was used: 



P450 BM-3 mutant: 17.5 mg ( lyophilisate ) 

Reaction buffer: 9 . 1 ml (potassium phosphate buffer 50 mM, 

P H 7.5) 

20 Substrate: 50 \il of a 60 mM solution (in acetone) 

Temperature: 2 5°C 



The enzyme lyophilisate was dissolved in 500 u.1 of reaction buffer 
and initially incubated at room temperature with substrate and 

25 reaction buffer for 5 minutes. 300 u.1 NADPH solution (5 mg/ml) 
were then added. Addition of NADPH was repeated two more times. 
The progress of the reaction was monitored by measuring the 
absorption at 340 nm, which allows the NADPH decrease to be 
observed. NADPH is added in aliquots of 300 since too high a 

30 concentration of NADPH in the reaction solution would result in 
inactivation of the enzyme. To isolate the products, the reaction 
solution was then extracted three times with 5 ml of diethyl 
ether. The combined organic phases were dried over MgS0 4 and 
concentrated. The products were then characterized by TLC, GC/MS 

35 and NMR . 



The GC/MS analysis of the reaction mixture gave the following 
result : 



40 Compound Rtfmin] 1 ) Conversion [%] 

4-octanol 13.51 37 

3-octanol 14.08 47 

2-octanol 14.26 16 

45 !) Temperature program: 40°C 1 min isothermic / 3°C/min 95°C 
/10°C/min 2 7 5°C; apparatus: Finnigan MAT 95; GC : HP 5890 Series II 
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Split Injector; Column: HP-5MS ( methyls iloxane ) 30m x 0.25mm; 
Carrier gas: 0.065 ml of He/min 

No starting material was found. 

5 

Example 7 : 

Hydroxy lat ion of aromatics, heteroaromatics and trimethylcyclo- 
hexenyl compounds 

10 

a) Example 6 was repeated, but using, instead of n-octane, the 

substrate naphthalene. The products that were identified were 
1-naphthol and cis-1 , 2-dihydroxy-l , 2-dihydronaphthalene . 88% 
of the naphthalene starting material had been converted. 

15 

Analytic methods for reactions with naphthalene 



GC: 

Apparatus: Carlo Erba Strumentazion Typ HRGC 4160 on Column 
Injector; Column: DB5 30m x 0.2 mm; Material: 5% diphenyl- 
95% dimethylpolysiloxane ; Carrier gas: 0.5 bar H2 ; 
Temperature program: 40°C 1 min isothermic / 10°C/min to 300°C 
Rt( 1-naphthol) = 16.68 



NMR: 

1-Naphthol and cis-1 , 2-dihydroxy-l , 2-dihydro-naphthalene were 
identified in the X H NMR. 



b) Example 6 was repeated but using, instead of n-octane, the 

substrate 8-methylquinoline . 5-Hydroxy-8-methylquinoline was 
identified as main product, in addition to other derivatives 
(product ratio 5:1). 35% of the starting material used had 
been converted. 



c) Example 6 was repeated but using, instead of n-octane, the 
substrate a-ionone. 3-Hydroxy-a-ionone was identified as main 
product, in addition to other derivatives (product ratio: 
76:24). 60% of the starting material used had been converted. 

d) Example 6 was repeated, but using, instead of n-octane, the 
substrate cumene ( isopropylbenzene ) . Five monohydroxy 
products and one dihydroxy product were identified. 70% of 
the starting material used had been converted. 



45 
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We claim: . 

1. A cytochrome P450 monooxygenase which is capable of at least 
5 one of the following reactions: 

a) oxidation of optionally substituted N-, O- or 
S-heterocyclic mono- or polynuclear aromatic compounds; 

b) oxidation of optionally substituted mono- or polynuclear 
aromatics ; 

10 c) oxidation of straight-chain or branched alkanes and 

alkenes; 

d) oxidation of optionally substituted cycloalkanes and 
cycloalkenes; 



15 where the monooxygenase is derived from cytochrome P45 0 

monooxygenase BM-3 from Bacillus megaterium having an amino 
acid sequence according to SEQ ID NO: 2, which, has at least 
one functional mutation in at least one of the amino acid 
sequence regions 172-224, 39-43, 48-52, 67-70, 330-335, 

20 3 52-356, 73-82 and 86-88; except . the single mutant Phe87Val. 

2- A monooxygenase as claimed in claim 1, which has at least one 
functional mutation in at least one of the sequence regions 
73-82, 86-88 and 172-224. f, 

25 

3. A monooxygenase as claimed in claim 1, which has at least one 
of the following mono- or polyamino acid substitutions: 

a) Phe87Val, Leul88Gln; or 

b) Phe87Val, leul88Gln, Ala74Gly; 

30 and functional equivalents thereof which are capable of at 

least one of the above oxidation reactions . 

4. A nucleic acid sequence coding for a monooxygenase according 
to one of the preceding claims. 

35 

5. An expression construct comprising, under the genetic control 
of regulatory nucleic acid sequences, a coding sequence which 
comprises a nucleic acid sequence according to claim 4 . 

40 6. a vector comprising at least one expression construct 
according to claim 5. 



A recombinant microorganism transformed by at least one 
vector as claimed in claim 6. 
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8- A microorganism as claimed in claim 7, selected from bacteria 
of the genus Escherichia. 

9. A process for the microbiological oxidation of an N- or 

i S-heterocyclic mono- or polynuclear aromatic compound, which 

comprises 

al) culturing a recombinant microorganism which expresses a 
cytochrome P450 monooxygenase of bacterial origin in a 
culture medium, in the presence of an exogenous or 
i intermediately formed substrate; or 

a2 ) incubating a substrate-containing reaction medium vith a 
cytochrome P450 monooxygenase of bacterial origin; and 
b) isolating the oxidation product formed or a secondary 
product thereof from the medium. 

10. A process as claimed in claim 9, wherein the exogenous or 
intermediately formed substrate is selected from optionally 
substituted N- or S-heterocyclic mono- or polynuclear 
aromatic compounds . 

i 

11. A process as claimed in claim 9 or 10, where the 
monooxygenase is a mutant as claimed in any of claims 1 to 3 , 
including the mutant Phe37Val. 

12- A process as claimed in claim 11, where the mutant has at 
least one of the following mono- or polyamino acid 
substitutions : 

a) Phe87Val; 

b) Phe87Val, Leul88Gln; or 

c) Phe87Val, Leul88Gln, Ala74Gly. 

13. A process for microbiological oxidation of a compound as 
defined in claim lb), c) or d) , which comprises 

al) culturing a recombinant cytochrome P450-producing 
microorganism as claimed in claim 7 or 8 in a culture medium, 
in the presence of an exogenous or intermediately formed 
substrate; or 

a2) incubating a substrate-containing reaction medium with a 
cytochrome P450 monooxygenase as claimed in any of claims 1 
to 3 ; and 

b) isolating the oxidation product formed or a secondary 
product thereof from the medium; 

where the monooxygenase mutant Phe8 7Val is not excluded. 

14. A process as claimed in claim 13, wherein the exogenous or 
intermediately formed substrate is selected from: 
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a) optionally substituted mono- or polynuclear aromatics; 

b) straight -chain or branched alkanes and alkenes; 

c) optionally substituted cycloalkanes and cycloalkenes . 

. a process as claimed in claim 13 or 14, where the 

monooxygenase is a mutant as claimed in any of claims 1 to 3, 
including the mutant Phe87Val. 

. A process as claimed in claim 15, where the mutant has at 
least one of the following mono- or polyamino acid 
substitutions : 

a) Phe87Val; 

b) Phe87Val, LeulSBGln; or 

c) Phe87Val, LeulSBGln , Ala74Gly. 

A process as claimed in any of claims 9 to 16, wherein, as 
exogenous substrate, at least one compound selected from the 
groups a) to d) of compounds defined above is added to a 
medium and the oxidation is carried out by enzymatic reaction 
of the substrate-containing medium in the presence of oxygen 
at a temperature of approximately 2 0 to 4 0°C and a pH of 
approximately 6 to 9 , where the substrate-containing medium 
additionally contains an approximately 10- to 100-fold molar 
excess of reduction equivalents based on the substrate. 

A process as claimed in claim 17, wherein, as exogenous 
substrate, a compound selected from indole, n-hexane, 
n-octane, n-decane, n-dodecane, cumene, 1-methylindole, a-, (5- 
or y-ionone, acridine, naphthalene, 6-methyl- or 
8-methylquinoline, quinoline and quinaldine is employed. 

A process for the microbiological production of indigo and/or 
indirubin, which comprises 

al) culturing a recombinant microorganism which produces an 
indole-oxidizing cytochrome P450 in a culture medium, in the 
presence of exogenous or intermediately formed indole; or 
a2) incubating an indole-containing reaction medium with an 
indole-oxidi2ing cytochrome P450 monooxygenase; and 
b) isolating the oxidation product formed or a secondary 
product thereof from the medium; 

A process as claimed in claim 19, wherein the indigo and/or 
indirubin obtained, which was produced by oxidation of 
intermediately formed indole, is isolated from the medium. 
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21. A process as claimed in claim 20, wherein the indole 
oxidation is carried out by culturing the microorganisms in 
the presence of oxygen at a culturing temperature of 
approximately 20 to 4 0°C and a pH of approximately 6 to 9 . 

5 

22. A process as alaimed in claim 20 or 21, where the 
monooxygenase is a mutant as claimed in any of claims 1 to 3 
including the mutant PheB7Val. 

10 23. A process as claimed in claim 22, where the mutant has at 
least one of the following mono- or polyamino acid 
substitutions: 

a) Phe87Val; 

b) Phe87Val, Leul88Gln; or 

15 c) Phe87Val, Leul88Gln, Ala74Gly. 

24. A bioreactor comprising an enzyme as claimed in one of claims 
1 to 3 or a recombinant microorganism as claimed in one of 
claims 7 or 8 in immobilized form. 

20 

25. The use of a cytochrome P450 monooxygenase as claimed in one 
of claims 1 to 3 , of a vector as claimed in claim 6, or of a 
microorganism as claimed in claim 7 or 8 for the 

2 S microbiological oxidation of 

a) optionally substituted O- or S-heterocyclic mono- or 
polynuclear aromatic compounds; 

b) optionally substituted mono- or polynuclear aromatics: 

c) straight-chain or branched alkanes and alkenes; and/or 
3Q d) optionally substituted cycloalkanes and cycloalkenes , 

where the monooxygenase mutant Phe8 7Val is not excluded. 

26. The use of a microorganism producing indole-oxidizing 
cytochrome P450 for the preparation of indigo and/or 

,_ indirubin. 
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filing date of this application. 
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SEQUENCE LISTING 



<110> H ALTER , Bernhard 
PLEISS, Juergen 
SCHWANEBERG, Ulrich 
SCHMITT, Jutta 

<120 Novel cytochrome P450 monooxygenases and their use for the oxidation of 
organic substrates 

<130> 50915 

<140> 10/031,146 

<141> Filing date not yet granted 

<150> PCT/EP 00/07253 
<151> 2000-07-27 



<170> Word Perfect version 6.1 

<210> 1 
<211> 3150 
<212> DNA 

<213> Bacillus megaterium 

<220> 

<221> CDS 

<222> (4) . . (3150) 

<400> 1 

atg aca att aaa gaa atg cct cag cca aaa acg ttt gga gag ctt aaa 48 

Thr lie Lys Glu Met Pro Gin Pro Lys Thr Phe Gly Glu Leu Lys 

1 5 10 15 

aat tta ccg tta tta aac aca gat aaa ccg gtt caa get ttg atg aaa 96 
Asn Leu Pro Leu Leu Asn Thr Asp Lys Pro Val Gin Ala Leu Met Lys 
20 25 30 

att gcg gat gaa tta gga gaa ate ttt aaa ttc gag gcg cct ggt cgt 144 
lie Ala Asp Glu Leu Gly Glu lie Phe Lys Phe Glu Ala Pro Gly Arg 
35 40 45 

gta acg cgc tac tta tea agt cag cgt eta att aaa gaa gca tgc gat 192 
Val Thr Arg Tyr Leu Ser Ser Gin Arg Leu lie Lys Glu Ala Cys Asp 
50 55 60 

gaa tea cgc ttt gat aaa aac tta agt caa gcg ctt aaa ttt gta cgt 240 
Glu Ser Arg Phe Asp Lys Asn Leu Ser Gin Ala Leu Lys Phe Val Arg 
65 70 75 

gat ttt gca gga gac ggg tta ttt aca age tgg acg cat gaa aaa aat 288 
Asp Phe Ala Gly Asp Gly Leu Phe Thr Ser Trp Thr His Glu Lys Asn 
80 85 90 95 

tgg aaa aaa gcg cat aat ate tta ctt cca age ttc agt cag cag gca 336 
Trp Lys Lys Ala His Asn lie Leu Leu Pro Ser Phe Ser Gin Gin Ala 
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atg aaa ggc tat cat gcg atg atg gtc gat ate gec gtg cag ctt gtt 384 

Met Lys Gly Tyr His Ala Met Met Val Asp lie Ala Val Gin Leu Val 
115 120 125 

caa aag tgg gag cgt eta aat gca gat gag cat att gaa gta ccg gaa 43 2 

Gin Lys Trp Glu Arg Leu Asn Ala Asp Glu His lie Glu Val Pro Glu 

130 135 140 

gac atg aca cgt tta acg ctt gat aca att ggt ctt tgc ggc ttt aac 480 

Asp Met Thr Arg Leu Thr Leu Asp Thr lie Gly Leu Cys Gly Phe Asn 

145 150 155 

tat cgc ttt aac age ttt tac cga gat cag cct cat cca ttt att aca 528 

Tyr Arg Phe Asn Ser Phe Tyr Arg Asp Gin Pro His Pro Phe lie Thr 

160 165 170 175 

agt atg gtc cgt gca ctg gat gaa gca atg aac aag ctg cag cga gca 576 

Ser Met Val Arg Ala Leu Asp Glu Ala Met Asn Lys Leu Gin Arg Ala 
180 185 190 

aat cca gac gac cca get tat gat gaa aac aag cgc cag ttt caa gaa 624 

Asn Pro Asp Asp Pro Ala Tyr Asp Glu Asn Lys Arg Gin Phe Gin Glu 
195 200 205 

gat ate aag gtg atg aac gac eta gta gat aaa att att gca gat cgc 672 

Asp lie Lys Val Met Asn Asp Leu Val Asp Lys lie lie Ala Asp Arg 

210 215 220 

aaa gca age ggt gaa caa age gat gat tta tta acg cat atg eta aac 720 

Lys Ala Ser Gly Glu Gin Ser Asp Asp Leu Leu Thr His Met Leu Asn 

225 230 235 

gga aaa gat cca gaa acg ggt gag ccg ctt gat gac gag aac att cgc 76 8 

Gly Lys Asp Pro Glu Thr Gly Glu Pro Leu Asp Asp Glu Asn lie Arg 

240 245 250 255 

tat caa att att aca ttc tta att gcg gga cac gaa aca aca agt ggt 816 

Tyr Gin lie lie Thr Phe Leu lie Ala Gly His Glu Thr Thr Ser Gly 
260 265 270 

ctt tta tea ttt gcg ctg tat ttc tta gtg aaa aat cca cat gta tta 864 

Leu Leu Ser Phe Ala Leu Tyr Phe Leu Val Lys Asn Pro His Val Leu 
275 280 285 

caa aaa gca gca gaa gaa gca gca cga gtt eta gta gat cct gtt cca 912 

Gin Lys Ala Ala Glu Glu Ala Ala Arg Val Leu Val Asp Pro Val Pro 

290 295 300 

age tac aaa caa gtc aaa cag ctt aaa tat gtc ggc atg gtc tta aac 960 

Ser Tyr Lys Gin Val Lys Gin Leu Lys Tyr Val Gly Met Val Leu Asn 

305 310 315 

gaa gcg ctg cgc tta tgg cca act get cct gcg ttt tec eta tat gca 1008 

Glu Ala Leu Arg Leu Trp Pro Thr Ala Pro Ala Phe Ser Leu Tyr Ala 

320 325 330 335 

aaa gaa gat acg gtg ctt gga gga gaa tat cct tta gaa aaa ggc gac 1056 
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Lys Glu Asp Thr Val Leu Gly Gly Glu Tyr Pro Leu Glu Lys Gly Asp 

340 345 350 

gaa eta atg gtt ctg att cct cag ctt cac cgt gat aaa aca att tgg 

Glu Leu Met Val Leu lie Pro Gin Leu His Arg Asp Lys Thr lie Trp 

355 360 365 

gga gac gat gtg gaa gag ttc cgt cca gag cgt ttt gaa aat cca agt 

Gly Asp Asp Val Glu Glu Phe Arg Pro Glu Arg Phe Glu Asn Pro Ser 
370 375 380 

gcg att ccg cag cat gcg ttt aaa ccg ttt gga aac ggt cag cgt gcg 

Ala lie Pro Gin His Ala Phe Lys Pro Phe Gly Asn Gly Gin Arg Ala 

385 390 395 

tgt ate ggt cag cag ttc get ctt cat gaa gca acg ctg gta ctt ggt 

Cys lie Gly Gin Gin Phe Ala Leu His Glu Ala Thr Leu Val Leu Gly 

400 405 410 415 

atg atg eta aaa cac ttt gac ttt gaa gat cat aca aac tac gag ctg 

Met Met Leu Lys His Phe Asp Phe Glu Asp His Thr Asn Tyr Glu Leu 

420 425 430 

gat att aaa gaa act tta acg tta aaa cct gaa ggc ttt gtg gta aaa 

Asp lie Lys Glu Thr Leu Thr Leu Lys Pro Glu Gly Phe Val Val Lys 

435 440 445 

gca aaa teg aaa aaa att ccg ctt ggc ggt att cct tea cct age act 

Ala Lys Ser Lys Lys lie Pro Leu Gly Gly lie Pro Ser Pro Ser Thr 
450 455 460 

gaa cag tct get aaa aaa gta cgc aaa aag gca gaa aac get cat aat 

Glu Gin Ser Ala Lys Lys Val Arg Lys Lys Ala Glu Asn Ala His Asn 
465 470 475 

acg ccg ctg ctt gtg eta tac ggt tea aat atg gga aca get gaa gga 

Thr Pro Leu Leu Val Leu Tyr Gly Ser Asn Met Gly Thr Ala Glu Gly 

480 485 490 495 

acg gcg cgt gat tta gca gat att gca atg age aaa gga ttt gca ccg 

Thr Ala Arg Asp Leu Ala Asp lie Ala Met Ser Lys Gly Phe Ala Pro 

500 " 505 ' 510 

cag gtc gca acg ctt gat tea cac gee gga aat ctt ccg cgc gaa gga 

Gin Val Ala Thr Leu Asp Ser His Ala Gly Asn Leu Pro Arg Glu Gly 

515 520 525 

get gta tta att gta acg gcg tct tat aac ggt cat ccg cct gat aac 

Ala Val Leu lie Val Thr Ala Ser Tyr Asn Gly His Pro Pro Asp Asn 
530 535 540 

gca aag caa ttt gtc gac tgg tta gac caa gcg tct get gat gaa gta 

Ala Lys Gin Phe Val Asp Trp Leu Asp Gin Ala Ser Ala Asp Glu Val 
545 550 555 

aaa ggc gtt cgc tac tec gta ttt gga tgc ggc gat aaa aac tgg get 

Lys Gly Val Arg Tyr Ser Val Phe Gly Cys Gly Asp Lys Asn Trp Ala 

560 565 570 575 
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act acg tat caa aaa gtg cct get ttt ate gat gaa acg ctt gec get 1776 

Thr Thr Tyr Gin Lys Val Pro Ala Phe lie Asp Glu Thr Leu Ala Ala 

580 585 590 

aaa ggg gca gaa aac ate get gac cgc ggt gaa gca gat gca age gac 1824 

Lys Gly Ala Glu Asn lie Ala Asp Arg Gly Glu Ala Asp Ala Ser Asp 
595 600 605 

gac ttt gaa ggc aca tat gaa gaa tgg cgt gaa cat atg tgg agt gac 1872 

Asp Phe Glu Gly Thr Tyr Glu Glu Trp Arg Glu His Met Trp Ser Asp 
610 615 620 

gta gca gee tac ttt aac etc gac att gaa aac agt gaa gat aat aaa 1920 

Val Ala Ala Tyr Phe Asn Leu Asp He Glu Asn Ser Glu Asp Asn Lys 
625 630 635 

tct act ctt tea ctt caa ttt gtc gac age gee gcg gat atg ccg ctt 1968 

Ser Thr Leu Ser Leu Gin Phe Val Asp Ser Ala Ala Asp Met Pro Leu 

640 645 650 655 

gcg aaa atg cac ggt gcg ttt tea acg aac gtc gta gca age aaa gaa 2016 

Ala Lys Met His Gly Ala Phe Ser Thr Asn Val Val Ala Ser Lys Glu 

660 665 670 

ctt caa cag cca ggc agt gca cga age acg cga cat ctt gaa att gaa 2064 

Leu Gin Gin Pro Gly Ser Ala Arg Ser Thr Arg His Leu Glu He Glu 
675 680 685 

ctt cca aaa gaa get tct tat caa gaa gga gat cat tta ggt gtt att 2112 

Leu Pro Lys Glu Ala Ser Tyr Gin Glu Gly Asp His Leu Gly Val He 
690 695 700 

cct cgc aac tat gaa gga ata gta aac cgt gta aca gca agg ttc ggc 2160 

Pro Arg Asn Tyr Glu Gly lie Val Asn Arg Val Thr Ala Arg Phe Gly 
705 710 " 715 

eta gat gca tea cag caa ate cgt ctg gaa gca gaa gaa gaa aaa tta 22 08 

Leu Asp Ala Ser Gin Gin He Arg Leu Glu Ala Glu Glu Glu Lys Leu 

720 725 730 735 

get cat ttg cca etc get aaa aca gta tec gta gaa gag ctt ctg caa 2256 

Ala His Leu Pro Leu Ala Lys Thr Val Ser Val Glu Glu Leu Leu Gin 

740 745 750 

tac gtg gag ctt caa gat cct gtt acg cgc acg cag ctt cgc gca atg 23 04 

Tyr Val Glu Leu Gin Asp Pro Val Thr Arg Thr Gin Leu Arg Ala Met 
755 760 765 

get get aaa acg gtc tgc ccg ccg cat aaa gta gag ctt gaa gee ttg 2352 

Ala Ala Lys Thr Val Cys Pro Pro His Lys Val Glu Leu Glu Ala Leu 
770 775 780 

ctt gaa aag caa gee tac aaa gaa caa gtg ctg gca aaa cgt tta aca 2400 

Leu Glu Lys Gin Ala Tyr Lys Glu Gin Val Leu Ala Lys Arg Leu Thr 
785 790 795 

atg ctt gaa ctg ctt gaa aaa tac ccg gcg tgt gaa atg aaa ttc age 2448 

Met Leu Glu Leu Leu Glu Lys Tyr Pro Ala Cys Glu Met Lys Phe Ser 

800 805 810 815 
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gaa ttt ate gec ctt ctg cca age ata cgc ccg cgc tat tac teg att 2496 
Glu Phe lie Ala Leu Leu Pro Ser lie Arg Pro Arg Tyr Tyr Ser lie 
820 825 830 

tct tea tea cct cgt gtc gat gaa aaa caa gca age ate acg gtc age 2544 
Ser Ser Ser Pro Arg Val Asp Glu Lys Gin Ala Ser lie Thr Val Ser 
835 840 845 

gtt gtc tea gga gaa gcg tgg age gga tat gga gaa tat aaa gga att 2592 
Val Val Ser Gly Glu Ala Trp Ser Gly Tyr Gly Glu Tyr Lys Gly lie 
850 855 860 

gcg teg aac tat ctt gee gag ctg caa gaa gga gat acg att acg tgc 2640 
Ala Ser Asn Tyr Leu Ala Glu Leu Gin Glu Gly Asp Thr lie Thr Cys 
865 870 875 

ttt att tec aca ccg cag tea gaa ttt acg ctg cca aaa gac cct gaa 2688 
Phe lie Ser Thr Pro Gin Ser Glu Phe Thr Leu Pro Lys Asp Pro Glu 
880 885 890 895 

acg ccg ctt ate atg gtc gga ccg gga aca ggc gtc gcg ccg ttt aga 2736 
Thr Pro Leu lie Met Val Gly Pro Gly Thr Gly Val Ala Pro Phe Arg 
900 905 910 

ggc ttt gtg cag gcg cgc aaa cag eta aaa gaa caa gga cag tea ctt 2784 
Gly Phe Val Gin Ala Arg Lys Gin Leu Lys Glu Gin Gly Gin Ser Leu 
915 920 925 

gga gaa gca cat tta tac ttc ggc tgc cgt tea cct cat gaa gac tat 2832 
Gly Glu Ala His Leu Tyr Phe Gly Cys Arg Ser Pro His Glu Asp Tyr 
930 935 940 

ctg tat caa gaa gag ctt gaa aac gee caa age gaa ggc ate att acg 2880 
Leu Tyr Gin Glu Glu Leu Glu Asn Ala Gin Ser Glu Gly lie lie Thr 
945 950 955 

ctt cat ace get ttt tct cgc atg cca aat cag ccg aaa aca tac gtt 2 928 
Leu His Thr Ala Phe Ser Arg Met Pro Asn Gin Pro Lys Thr Tyr Val 
960 965 970 975 

cag cac gta atg gaa caa gac ggc aag aaa ttg att gaa ctt ctt gat 2 976 
Gin His Val Met Glu Gin Asp Gly Lys Lys Leu lie Glu Leu Leu Asp 
980 985 990 

caa gga gcg cac ttc tat att tgc gga gac gga age caa atg gca cct 3 024 
Gin Gly Ala His Phe Tyr lie Cys Gly Asp Gly Ser Gin Met Ala Pro 
995 1000 1005 

gec gtt gaa gca acg ctt atg aaa age tat get gac gtt cac caa gtg 3 072 
Ala Val Glu Ala Thr Leu Met Lys Ser Tyr Ala Asp Val His Gin Val 
1010 1015 1020 

agt gaa gca gac get cgc tta tgg ctg cag cag eta gaa gaa aaa ggc 312 0 
Ser Glu Ala Asp Ala Arg Leu Trp Leu Gin Gin Leu Glu Glu Lys Gly 
1025 1030 1035 

cga tac gca aaa gac gtg tgg get ggg taa 3150 
Arg Tyr Ala Lys Asp Val Trp Ala Gly 
1040 1045 
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<210> 2 
<211> 1048 
<212> PRT 

<213> Bacillus megaterium 



<400> 2 

Thr lie Lys Glu Met Pro Gin Pro 
1 5 

Leu Pro Leu Leu Asn Thr Asp Lys 
20 



Lys Thr Phe Gly Glu Leu Lys Asn 
10 ' 15 

Pro Val Gin Ala Leu Met Lys lie 
25 30 



Ala Asp Glu Leu Gly Glu lie Phe 

35 40 

Thr Arg Tyr Leu Ser Ser Gin Arg 
50 55 

Ser Arg Phe Asp Lys Asn Leu Ser 
65 70 

Phe Ala Gly Asp Gly Leu Phe Thr 
85 

Lys Lys Ala His Asn lie Leu Leu 
100 

Lys Gly Tyr His Ala Met Met Val 
115 120 



Lys Phe Glu Ala Pro Gly Arg Val 
45 

Leu lie Lys Glu Ala Cys Asp Glu 

60 

Gin Ala Leu Lys Phe Val Arg Asp 
75 80 

Ser Trp Thr His Glu Lys Asn Trp 
90 95 

Pro Ser Phe Ser Gin Gin Ala Met 
105 110 

Asp He Ala Val Gin Leu Val Gin 
125 



Lys Trp Glu Arg Leu Asn Ala Asp 
130 135 

Met Thr Arg Leu Thr Leu Asp Thr 
145 150 

Arg Phe Asn Ser Phe Tyr Arg Asp 
165 

Met Val Arg Ala Leu Asp Glu Ala 
180 



Glu His He Glu Val Pro Glu Asp 
140 

He Gly Leu Cys Gly Phe Asn Tyr 

155 160 

Gin Pro His Pro Phe He Thr Ser 
170 175 

Met Asn Lys Leu Gin Arg Ala Asn 
185 190 



Pro Asp Asp Pro 
195 

He Lys Val Met 
210 

Ala Ser Gly Glu 
225 

Lys Asp Pro Glu 



Gin He He Thr 

260 



Ala Tyr Asp Glu 
200 

Asn Asp Leu Val 
215 

Gin Ser Asp Asp 
230 

Thr Gly Glu Pro 
245 

Phe Leu He Ala 



Asn Lys Arg Gin 



Asp Lys He He 
220 

Leu Leu Thr His 
235 

Leu Asp Asp Glu 

250 

Gly His Glu Thr 
265 



Phe Gin Glu Asp 

205 

Ala Asp Arg Lys 



Met Leu Asn Gly 
240 

Asn He Arg Tyr 
255 

Thr Ser Gly Leu 
270 
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Leu Ser Phe Ala 
275 

Lys Ala Ala Glu 
290 

Tyr Lys Gin Val 
305 

Ala Leu Arg Leu 



Glu Asp Thr Val 
340 

Leu Met Val Leu 

355 

Asp Asp Val Glu 
370 

lie Pro Gin His 
385 

He Gly Gin Gin 



Met Leu Lys His 

420 

He Lys Glu Thr 
435 

Lys Ser Lys Lys 
450 

Gin Ser Ala Lys 
465 

Pro Leu Leu Val 



Ala Arg Asp Leu 
500 

Val Ala Thr Leu 
515 

Val Leu He Val 
530 

Lys Gin Phe Val 

545 

Gly Val Arg Tyr 



Thr Tyr Gin Lys 
580 



Leu Tyr Phe Leu 
280 

Glu Ala Ala Arg 
295 

Lys Gin Leu Lys 
310 

Trp Pro Thr Ala 
325 

Leu Gly Gly Glu 



He Pro Gin Leu 
360 

Glu Phe Arg Pro 
375 

Ala Phe Lys Pro 
390 

Phe Ala Leu His 
405 

Phe Asp Phe Glu 



Leu Thr Leu Lys 
440 

He Pro Leu Gly 
455 

Lys Val Arg Lys 
470 

Leu Tyr Gly Ser 
485 

Ala Asp He Ala 



Asp Ser His Ala 
520 

Thr Ala Ser Tyr 
535 

Asp Trp Leu Asp 

550 

Ser Val Phe Gly 
565 

Val Pro Ala Phe 
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Val Lys Asn Pro 



Val Leu Val Asp 
300 

Tyr Val Gly Met 
315 

Pro Ala Phe Ser 
330 

Tyr Pro Leu Glu 

345 

His Arg Asp Lys 



Glu Arg Phe Glu 
380 

Phe Gly Asn Gly 
395 

Glu Ala Thr Leu 

410 

Asp His Thr Asn 
425 

Pro Glu Gly Phe 



Gly He Pro Ser 
460 

Lys Ala Glu Asn 
475 

Asn Met Gly Thr 
490 

Met Ser Lys Gly 
505 

Gly Asn Leu Pro 



Asn Gly His Pro 
540 

Gin Ala Ser Ala 
555 

Cys Gly Asp Lys 
570 

He Asp Glu Thr 
585 



His Val Leu Gin 
285 

Pro Val Pro Ser 



Val Leu Asn Glu 
320 

Leu Tyr Ala Lys 
335 

Lys Gly Asp Glu 
350 

Thr He Trp Gly 

365 

Asn Pro Ser Ala 



Gin Arg Ala Cys 
400 

Val Leu Gly Met 
415 

Tyr Glu Leu Asp 

430 

Val Val Lys Ala 
445 

Pro Ser Thr Glu 



Ala His Asn Thr 
480 

Ala Glu Gly Thr 
495 

Phe Ala Pro Gin 
510 

Arg Glu Gly Ala 
52 5 

Pro Asp Asn Ala 



Asp Glu Val Lys 

560 



Asn Trp Ala Thr 

575 

Leu Ala Ala Lys 
590 
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Gly Ala Glu Asn lie Ala Asp Arg Gly Glu Ala Asp Ala Ser Asp Asp 
595 600 605 

Phe Glu Gly Thr Tyr Glu Glu Trp Arg Glu His Met Trp Ser Asp Val 
610 ' " 615 ~ 620 

Ala Ala Tyr Phe Asn Leu Asp lie Glu Asn Ser Glu Asp Asn Lys Ser 
625 630 635 640 

Thr Leu Ser Leu Gin Phe Val Asp Ser Ala Ala Asp Met Pro Leu Ala 

645 650 655 

Lys Met His Gly Ala Phe Ser Thr Asn Val Val Ala Ser Lys Glu Leu 
660 665 670 

Gin Gin Pro Gly Ser Ala Arg Ser Thr Arg His Leu Glu lie Glu Leu 
675 680 685 

Pro Lys Glu Ala Ser Tyr Gin Glu Gly Asp His Leu Gly Val lie Pro 
690 695 700 

Arg Asn Tyr Glu Gly lie Val Asn Arg Val Thr Ala Arg Phe Gly Leu 
705 710 715 720 

Asp Ala Ser Gin Gin lie Arg Leu Glu Ala Glu Glu Glu Lys Leu Ala 
725 730 735 

His Leu Pro Leu Ala Lys Thr Val Ser Val Glu Glu Leu Leu Gin Tyr 
740 745 750 

Val Glu Leu Gin Asp Pro Val Thr Arg Thr Gin Leu Arg Ala Met Ala 

755 760 765 

Ala Lys Thr Val Cys Pro Pro His Lys Val Glu Leu Glu Ala Leu Leu 
770 775 780 

Glu Lys Gin Ala Tyr Lys Glu Gin Val Leu Ala Lys Arg Leu Thr Met 
785 790 795 800 

Leu Glu Leu Leu Glu Lys Tyr Pro Ala Cys Glu Met Lys Phe Ser Glu 
805 810 ~ 815 

Phe lie Ala Leu Leu Pro Ser lie Arg Pro Arg Tyr Tyr Ser lie Ser 
820 825 830 

Ser Ser Pro Arg Val Asp Glu Lys Gin Ala Ser lie Thr Val Ser Val 
835 840 845 

Val Ser Gly Glu Ala Trp Ser Gly Tyr Gly Glu Tyr Lys Gly He Ala 
850 855 860 

Ser Asn Tyr Leu Ala Glu Leu Gin Glu Gly Asp Thr He Thr Cys Phe 
865 870 875 880 

He Ser Thr Pro Gin Ser Glu Phe Thr Leu Pro Lys Asp Pro Glu Thr 
885 890 " 895 



Pro Leu lie Met Val Gly Pro Gly Thr Gly Val Ala Pro Phe Arg Gly 



;i0 0 3JLJl.B4-S „ it'll ,1. .1, 7' 01 ;ei 



9 

900 

Phe Val Gin Ala Arg Lys Gin Leu 
915 920 

Glu Ala His Leu Tyr Phe Gly Cys 
930 935 

Tyr Gin Glu Glu Leu Glu Asn Ala 
945 950 

His Thr Ala Phe Ser Arg Met Pro 
965 

His Val Met Glu Gin Asp Gly Lys 

980 

Gly Ala His Phe Tyr lie Cys Gly 
995 1000 

Val Glu Ala Thr Leu Met Lys Ser 
1010 1015 

Glu Ala Asp Ala Arg Leu Trp Leu 
1025 1030 

Tyr Ala Lys Asp Val Trp Ala Gly 
1045 

<210> 3 
<211> 30 
<212> DNA 

<213> Synthetic sequence 
<220> 

<221> unsure 
<222> 1 . . 30 

<223>nisg, a, tore. 
<220> 

<223> Description of the synthetic sequence: PCR primer 
<400> 3 

gcaggagacg ggttgnnnac aagctggacg 3 0 

<210> 4 
<211> 30 
<212> DNA 

<213> Synthetic sequence 
<220> 

<221> unsure 
<222> 1. .30 

<223> n is g, a, t or c. 

<223> Description of the synthetic sequence: PCR primer 



905 910 

Lys Glu Gin Gly Gin Ser Leu Gly 

925 

Arg Ser Pro His Glu Asp Tyr Leu 
940 

Gin Ser Glu Gly lie lie Thr Leu 

955 960 

Asn Gin Pro Lys Thr Tyr Val Gin 
970 975 

Lys Leu lie Glu Leu Leu Asp Gin 
985 990 

Asp Gly Ser Gin Met Ala Pro Ala 

1005 

Tyr Ala Asp Val His Gin Val Ser 
1020 

Gin Gin Leu Glu Glu Lys Gly Arg 

1035 ~ 1040 



<400> 4 
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cgtccagctt gtnnncaacc cgtctcctgc 

<210> 5 
<211> 34 
<212> DNA 

<213> Synthetic sequence 
<220> 

<221> unsure 
<222> 1 . . 34 

<223> n is g, a, t or c. 

<223> Description of the synthetic sequence: PCR primer 
<400> 5 

gaagcaatga acaagnnnca gcgagcaaat ccag 

<210> 6 
<211> 30 
<212> DNA 

<213> Synthetic sequence 
<220> 

<221> unsure 
<222> 1 . . 30 

<223>nisg, a, tore. 

<223> Description of the synthetic sequence: PCR primer 
<400> 6 

ctggatttgc tegctgnnne ttgttcattg 

<210> 7 
<211> 41 
<212> DNA 

<213> Synthetic sequence 
<220> 

<221> unsure 
<222> 1 . .41 

<223> n is g, a, t or c. 

<223> Description of the synthetic sequence: PCR primer 
<400> 7 

gctttgataa aaacttaaag tcaannnctt aaatttgtac g 

<210> 8 
<211> 40 
<212> DNA 

<213> Synthetic sequence 
<220> 

<221> unsure 
<222> 1. .40 

<223> n is g, a, t or c. 

<223> Description of the synthetic sequence: PCR primer 



<400> 8 

cgtacaaatt taagnnnttg acttaagttt ttatcaaagc 

<210> 9 
<211> 1049 
<212> PRT 

<213> Bacillus megaterium 
<400> 9 

Met Thr lie Lys Glu Met Pro Gin Pro Lys Thr Phe Gly Glu Leu Lys 
1 5 10 15 

Asn Leu Pro Leu Leu Asn Thr Asp Lys Pro Val Gin Ala Leu Met Lys 
2 0 " 25 3 0 

lie Ala Asp Glu Leu Gly Glu lie Phe Lys Phe Glu Ala Pro Gly Arg 

3 5 4 0 4 5 

Val Thr Arg Tyr Leu Ser Ser Gin Arg Leu lie Lys Glu Ala Cys Asp 
50 55 60 

Glu Ser Arg Phe Asp Lys Asn Leu Ser Gin Ala Leu Lys Phe Val Arg 
65 70 75 80 

Asp Phe Ala Gly Asp Gly Leu Phe Thr Ser Trp Thr His Glu Lys Asn 
85 90 95 

Trp Lys Lys Ala His Asn lie Leu Leu Pro Ser Phe Ser Gin Gin Ala 
100 105 110 

Met Lys Gly Tyr His Ala Met Met Val Asp lie Ala Val Gin Leu Val 
115 120 125 

Gin Lys Trp Glu Arg Leu Asn Ala Asp Glu His He Glu Val Pro Glu 
130 135 140 

Asp Met Thr Arg Leu Thr Leu Asp Thr He Gly Leu Cys Gly Phe Asn 
145 150 155 ' 160 

Tyr Arg Phe Asn Ser Phe Tyr Arg Asp Gin Pro His Pro Phe He Thr 
165 170 175 

Ser Met Val Arg Ala Leu Asp Glu Ala Met Asn Lys Leu Gin Arg Ala 
180 185 190 

Asn Pro Asp Asp Pro Ala Tyr Asp Glu Asn Lys Arg Gin Phe Gin Glu 
195 200 205 

Asp He Lys Val Met Asn Asp Leu Val Asp Lys He He Ala Asp Arg 
210 215 220 

Lys Ala Ser Gly Glu Gin Ser Asp Asp Leu Leu Thr His Met Leu Asn 
225 230 235 240 

Gly Lys Asp Pro Glu Thr Gly Glu Pro Leu Asp Asp Glu Asn He Arg 
245 250 255 
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Tyr Gin He He 

260 

Leu Leu Ser Phe 
275 

Gin Lys Ala Ala 
290 

Ser Tyr Lys Gin 
305 

Glu Ala Leu Arg 



Lys Glu Asp Thr 
340 

Glu Leu Met Val 
355 

Gly Asp Asp Val 
370 

Ala He Pro Gin 
385 

Cys He Gly Gin 



Met Met Leu Lys 
420 

Asp He Lys Glu 
435 

Ala Lys Ser Lys 
450 

Glu Gin Ser Ala 
465 

Thr Pro Leu Leu 



Thr Ala Arg Asp 
500 

Gin Val Ala Thr 
515 

Ala Val Leu He 
530 

Ala Lys Gin Phe 
545 

Lys Gly Val Arg 



Thr Phe Leu He 



Ala Leu Tyr Phe 
280 

Glu Glu Ala Ala 
295 

Val Lys Gin Leu 
310 

Leu Trp Pro Thr 

325 

Val Leu Gly Gly 



Leu He Pro Gin 
360 

Glu Glu Phe Arg 
375 

His Ala Phe Lys 

390 

Gin Phe Ala Leu 
405 

His Phe Asp Phe 



Thr Leu Thr Leu 
440 

Lys He Pro Leu 
455 

Lys Lys Val Arg 
470 

Val Leu Tyr Gly 
485 

Leu Ala Asp He 



Leu Asp Ser His 
520 

Val Thr Ala Ser 
535 

Val Asp Trp Leu 
550 

Tyr Ser Val Phe 
565 



Ala Gly His Glu 

265 

Leu Val Lys Asn 



Arg Val Leu Val 
300 

Lys Tyr Val Gly 
315 

Ala Pro Ala Phe 

330 

Glu Tyr Pro Leu 
345 

Leu His Arg Asp 



Pro Glu Arg Phe 
380 

Pro Phe Gly Asn 
395 

His Glu Ala Thr 
410 

Glu Asp His Thr 
425 

Lys Pro Glu Gly 



Gly Gly He Pro 
460 

Lys Lys Ala Glu 
475 

Ser Asn Met Gly 
490 

Ala Met Ser Lys 
505 

Ala Gly Asn Leu 



Tyr Asn Gly His 
540 

Asp Gin Ala Ser 
555 

Gly Cys Gly Asp 
570 



Thr Thr Ser Gly 
270 

Pro His Val Leu 
285 

Asp Pro Val Pro 



Met Val Leu Asn 
320 

Ser Leu Tyr Ala 
335 

Glu Lys Gly Asp 
350 

Lys Thr He Trp 
365 

Glu Asn Pro Ser 



Gly Gin Arg Ala 
400 

Leu Val Leu Gly 
415 

Asn Tyr Glu Leu 
430 

Phe Val Val Lys 
445 

Ser Pro Ser Thr 



Asn Ala His Asn 
480 

Thr Ala Glu Gly 
495 

Gly Phe Ala Pro 
510 

Pro Arg Glu Gly 
525 

Pro Pro Asp Asn 



Ala Asp Glu Val 
560 

Lys Asn Trp Ala 
575 
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Thr Thr Tyr Gin Lys Val Pro Ala Phe lie Asp Glu Thr Leu Ala Ala 
580 585 590 

Lys Gly Ala Glu Asn lie Ala Asp Arg Gly Glu Ala Asp Ala Ser Asp 
595 600 605 

Asp Phe Glu Gly Thr Tyr Glu Glu Trp Arg Glu His Met Trp Ser Asp 
610 615 620 

Val Ala Ala Tyr Phe Asn Leu Asp lie Glu Asn Ser Glu Asp Asn Lys 
625 630 635 ~~ 640 

Ser Thr Leu Ser Leu Gin Phe Val Asp Ser Ala Ala Asp Met Pro Leu 
645 650 655 

Ala Lys Met His Gly Ala Phe Ser Thr Asn Val Val Ala Ser Lys Glu 
660 665 670 

Leu Gin Gin Pro Gly Ser Ala Arg Ser Thr Arg His Leu Glu lie Glu 
675 680 685 

Leu Pro Lys Glu Ala Ser Tyr Gin Glu Gly Asp His Leu Gly Val lie 
690 695 700 

Pro Arg Asn Tyr Glu Gly lie Val Asn Arg Val Thr Ala Arg Phe Gly 
705 710 715 720 

Leu Asp Ala Ser Gin Gin lie Arg Leu Glu Ala Glu Glu Glu Lys Leu 
725 730 735 

Ala His Leu Pro Leu Ala Lys Thr Val Ser Val Glu Glu Leu Leu Gin 
740 745 750 

Tyr Val Glu Leu Gin Asp Pro Val Thr Arg Thr Gin Leu Arg Ala Met 
755 760 765 

Ala Ala Lys Thr Val Cys Pro Pro His Lys Val Glu Leu Glu Ala Leu 
770 775 780 

Leu Glu Lys Gin Ala Tyr Lys Glu Gin Val Leu Ala Lys Arg Leu Thr 
785 790 795 800 

Met Leu Glu Leu Leu Glu Lys Tyr Pro Ala Cys Glu Met Lys Phe Ser 
805 810 815 

Glu Phe lie Ala Leu Leu Pro Ser lie Arg Pro Arg Tyr Tyr Ser lie 

820 825 " 830 

Ser Ser Ser Pro Arg Val Asp Glu Lys Gin Ala Ser lie Thr Val Ser 
835 840 845 

Val Val Ser Gly Glu Ala Trp Ser Gly Tyr Gly Glu Tyr Lys Gly lie 
850 855 860 

Ala Ser Asn Tyr Leu Ala Glu Leu Gin Glu Gly Asp Thr lie Thr Cys 
865 870 875 880 

Phe lie Ser Thr Pro Gin Ser Glu Phe Thr Leu Pro Lys Asp Pro Glu 
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885 890 895 

Thr Pro Leu He Met Val Gly Pro Gly Thr Gly Val Ala Pro Phe Arg 
900 905 910 

Gly Phe Val Gin Ala Arg Lys Gin Leu Lys Glu Gin Gly Gin Ser Leu 
915 920 925 

Gly Glu Ala His Leu Tyr Phe Gly Cys Arg Ser Pro His Glu Asp Tyr 
930 935 940 

Leu Tyr Gin Glu Glu Leu Glu Asn Ala Gin Ser Glu Gly He He Thr 
945 950 955 ' 960 

Leu His Thr Ala Phe Ser Arg Met Pro Asn Gin Pro Lys Thr Tyr Val 
965 970 975 

Gin His Val Met Glu Gin Asp Gly Lys Lys Leu He Glu Leu Leu Asp 
980 985 990 

Gin Gly Ala His Phe Tyr He Cys Gly Asp Gly Ser Gin Met Ala Pro 
995 1000 1005 

Ala Val Glu Ala Thr Leu Met Lys Ser Tyr Ala Asp Val His Gin Val 
1010 1015 1020 

Ser Glu Ala Asp Ala Arg Leu Trp Leu Gin Gin Leu Glu Glu Lys Gly 
1025 1030 1035 1040 

Arg Tyr Ala Lys Asp Val Trp Ala Gly 
1045 
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SEQUENCE LISTING 

<110> BASF Aktiengesellschaf t 

<120> Novel cytochrome P450 monooxygenases and their use for the 
oxidation of organic substrates 

<130> M/40241 

<140> 
<141> 

<160> 9 

<170> Patentln Ver . 2.1 

<210> 1 

<211> 3150 

<212> DNA 

<213> Bacillus megaterium 

<220> 

<221> CDS 

<222> (4). .(3150) 

<400> 1 

atg aca att aaa gaa atg cct cag cca aaa acg ttt gga gag ctt aaa 4 8 

Thr lie Lys Glu Met Pro Gin Pro Lys Thr Phe Gly Glu Leu Lys 
15 10 15 

aat tta ccg tta tta aac aca gat aaa ccg gtt caa get ttg atg aaa 96 
Asn Leu Pro Leu Leu Asn Thr Asp Lys Pro Val Gin Ala Leu Met Lys 
20 25 30 

att gcg gat gaa tta gga gaa ate ttt aaa ttc gag gcg cct ggt cgt 144 
lie Ala Asp Glu Leu Gly Glu lie Phe Lys Phe Glu Ala Pro Gly Arg 
35 40 45 

gta acg cgc tac tta tea agt cag cgt eta att aaa gaa gca tgc gat 192 
Val Thr Arg Tyr Leu Ser Ser Gin Arg Leu lie Lys Glu Ala Cys Asp 
50 55 60 

gaa tea cgc ttt gat aaa aac tta agt caa gcg ctt aaa ttt gta cgt 240 
Glu Ser Arg Phe Asp Lys Asn Leu Ser Gin Ala Leu Lys Phe Val Arg 
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gat ttt gca gga gac ggg tta ttt aca age tgg acg cat gaa aaa aat 288 
Asp Phe Ala Gly Asp Gly Leu Phe Thr Ser Trp Thr His Glu Lys Asn 
80 85 90 95 



tgg aaa aaa gcg cat aat ate tta ctt cca age ttc agt cag cag gca 336 
Trp Lys Lys Ala His Asn lie Leu Leu Pro Ser Phe Ser Gin Gin Ala 
100 105 110 



atg aaa ggc tat cat gcg atg atg gtc gat ate gee gtg cag ctt gtt 3 84 
Met Lys Gly Tyr His Ala Met Met Val Asp He Ala Val Gin Leu Val 
115 120 125 



caa aag tgg gag cgt eta aat gca gat gag cat att gaa gta ccg gaa 4 32 
Gin Lys Trp Glu Arg Leu Asn Ala Asp Glu His He Glu Val Pro Glu 
130 135 140 



gac atg aca cgt tta acg ctt gat aca att ggt ctt tgc ggc ttt aac 480 
Asp Met Thr Arg Leu Thr Leu Asp Thr He Gly Leu Cys Gly Phe Asn 
145 150 155 



tat cgc ttt aac age ttt tac cga gat cag cct cat cca ttt att aca 52 8 
Tyr Arg Phe Asn Ser Phe Tyr Arg Asp Gin Pro His Pro Phe lie Thr 
160 165 170 175 



agt atg gtc cgt gca ctg gat gaa gca atg aac aag ctg cag cga gca 576 
Ser Met Val Arg Ala Leu Asp Glu Ala Met Asn Lys Leu Gin Arg Ala 
180 185 190 



aat cca gac gac cca get tat gat gaa aac aag cgc cag ttt caa gaa 624 
Asn Pro Asp Asp Pro Ala Tyr Asp Glu Asn Lys Arg Gin Phe Gin Glu 
195 200 205 



gat ate aag gtg atg aac gac eta gta gat aaa att att gca gat cgc 6 72 
Asp lie Lys Val Met Asn Asp Leu Val Asp Lys lie lie Ala Asp Arg 
210 215 220 



aaa gca age ggt gaa caa age gat gat tta tta acg cat atg eta aac 72 0 
Lys Ala Ser Gly Glu Gin Ser Asp Asp Leu Leu Thr His Met Leu Asn 
225 230 235 



gga aaa gat cca gaa acg ggt gag ccg ctt gat gac gag aac att cgc 7 68 
Gly Lys Asp Pro Glu Thr Gly Glu Pro Leu Asp Asp Glu Asn lie Arg 
240 245 250 255 



tat caa att att aca ttc tta att gcg gga cac gaa aca aca agt ggt 816 
Tyr Gin lie lie Thr Phe Leu lie Ala Gly His Glu Thr Thr Ser Gly 
260 265 270 



ctt tta tea ttt gcg ctg tat ttc tta gtg aaa aat cca cat gta tta 
Leu Leu Ser Phe Ala Leu Tyr Phe Leu Val Lys Asn Pro His Val Leu 
275 280 285 



:!!., o o 3 U & ,,. O :.t. ' 

0050/50915 



caa aaa gca gca gaa gaa gca gca cga gtt eta gta gat cct gtt cca 
Gin Lys Ala Ala Glu Glu Ala Ala Arg Val Leu Val Asp Pro Val Pro 
290 295 300 

age tac aaa caa gtc aaa cag ctt aaa tat gtc ggc atg gtc tta aac 
Ser Tyr Lys Gin Val Lys Gin Leu Lys Tyr Val Gly Met Val Leu Asn 
305 310 315 

gaa gcg ctg cgc tta tgg cca act get cct gcg ttt tec eta tat gca 
Glu Ala Leu Arg Leu Trp Pro Thr Ala Pro Ala Phe Ser Leu Tyr Ala 
320 325 330 335 

aaa gaa gat acg gtg ctt gga gga gaa tat cct tta gaa aaa ggc gac 
Lys Glu Asp Thr Val Leu Gly Gly Glu Tyr Pro Leu Glu Lys Gly Asp 
340 345 350 

gaa eta atg gtt ctg att cct cag ctt cac cgt gat aaa aca att tgg 
Glu Leu Met Val Leu lie Pro Gin Leu His Arg Asp Lys Thr lie Trp 
355 360 365 

gga gac gat gtg gaa gag ttc cgt cca gag cgt ttt gaa aat cca agt 
Gly Asp Asp Val Glu Glu Phe Arg Pro Glu Arg Phe Glu Asn Pro Ser 
370 375 380 

gcg att ccg cag cat gcg ttt aaa ccg ttt gga aac ggt cag cgt gcg 
Ala lie Pro Gin His Ala Phe Lys Pro Phe Gly Asn Gly Gin Arg Ala 
385 390 395 

tgt ate ggt cag cag ttc get ctt cat gaa gca acg ctg gta ctt ggt 
Cys lie Gly Gin Gin Phe Ala Leu His Glu Ala Thr Leu Val Leu Gly 
400 405 410 415 

atg atg eta aaa cac ttt gac ttt gaa gat cat aca aac tac gag ctg 
Met Met Leu Lys His Phe Asp Phe Glu Asp His Thr Asn Tyr Glu Leu 
420 425 430 

gat att aaa gaa act tta acg tta aaa cct gaa ggc ttt gtg gta aaa 
Asp lie Lys Glu Thr Leu Thr Leu Lys Pro Glu Gly Phe Val Val Lys 
435 440 445 

gca aaa teg aaa aaa att ccg ctt ggc ggt att cct tea cct age act 
Ala Lys Ser Lys Lys He Pro Leu Gly Gly He Pro Ser Pro Ser Thr 
450 455 460 

gaa cag tct get aaa aaa gta cgc aaa aag gca gaa aac get cat aat 
Glu Gin Ser Ala Lys Lys Val Arg Lys Lys Ala Glu Asn Ala His Asn 
465 470 475 



acg ccg ctg ctt gtg eta tac ggt tea aat atg gga aca get gaa gga 
Thr Pro Leu Leu Val Leu Tyr Gly Ser Asn Met Gly Thr Ala Glu Gly 
480 485 490 495 



1488 
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acg gcg cgt gat tta gca gat att gca atg age aaa gga ttt gca ccg 
Thr Ala Arg Asp Leu Ala Asp lie Ala Met Ser Lys Gly Phe Ala Pro 
500 505 510 

cag gtc gca acg ctt gat tea cac gec gga aat ctt ccg cgc gaa gga 
Gin Val Ala Thr Leu Asp Ser His Ala Gly Asn Leu Pro Arg Glu Gly 
515 520 525 

get gta tta att gta acg gcg tct tat aac ggt cat ccg cct gat aac 
Ala Val Leu lie Val Thr Ala Ser Tyr Asn Gly His Pro Pro Asp Asn 
530 535 540 

gca aag caa ttt gtc gac tgg tta gac caa gcg tct get gat gaa gta 
Ala Lys Gin Phe Val Asp Trp Leu Asp Gin Ala Ser Ala Asp Glu Val 
545 550 555 

aaa ggc gtt cgc tac tec gta ttt gga tgc ggc gat aaa aac tgg get 
Lys Gly Val Arg Tyr Ser Val Phe Gly Cys Gly Asp Lys Asn Trp Ala 
560 565 570 575 

act acg tat caa aaa gtg cct get ttt ate gat gaa acg ctt gee get 
Thr Thr Tyr Gin Lys Val Pro Ala Phe He Asp Glu Thr Leu Ala Ala 
580 585 590 

aaa ggg gca gaa aac ate get gac cgc ggt gaa gca gat gca age gac 
Lys Gly Ala Glu Asn He Ala Asp Arg Gly Glu Ala Asp Ala Ser Asp 
595 600 605 

gac ttt gaa ggc aca tat gaa gaa tgg cgt gaa cat atg tgg agt gac 
Asp Phe Glu Gly Thr Tyr Glu Glu Trp Arg Glu His Met Trp Ser Asp 
610 615 620 

gta gca gee tac ttt aac etc gac att gaa aac agt gaa gat aat aaa 
Val Ala Ala Tyr Phe Asn Leu Asp lie Glu Asn Ser Glu Asp Asn Lys 
625 630 635 

tct act ctt tea ctt caa ttt gtc gac age gee gcg gat atg ccg ctt 
Ser Thr Leu Ser Leu Gin Phe Val Asp Ser Ala Ala Asp Met Pro Leu 
640 645 650 655 

gcg aaa atg cac ggt gcg ttt tea acg aac gtc gta gca age aaa gaa 
Ala Lys Met His Gly Ala Phe Ser Thr Asn Val Val Ala Ser Lys Glu 
660 665 670 

ctt caa cag cca ggc agt gca cga age acg cga cat ctt gaa att gaa 
Leu Gin Gin Pro Gly Ser Ala Arg Ser Thr Arg His Leu Glu He Glu 
675 680 685 

ctt cca aaa gaa get tct tat caa gaa gga gat cat tta ggt gtt att 
Leu Pro Lys Glu Ala Ser Tyr Gin Glu Gly Asp His Leu Gly Val He 
690 695 700 
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cct cgc aac tat gaa gga ata gta aac cgt gta aca gca agg ttc ggc 
Pro Arg Asn Tyr Glu Gly lie Val Asn Arg Val Thr Ala Arg Phe Gly 
705 710 715 

eta gat gca tea cag caa ate cgt ctg gaa gca gaa gaa gaa aaa tta 
Leu Asp Ala Ser Gin Gin lie Arg Leu Glu Ala Glu Glu Glu Lys Leu 
720 725 730 735 

get cat ttg cca etc get aaa aca gta tec gta gaa gag ctt ctg caa 
Ala His Leu Pro Leu Ala Lys Thr Val Ser Val Glu Glu Leu Leu Gin 
740 745 750 

tac gtg gag ctt caa gat cct gtt acg cgc acg cag ctt cgc gca atg 
Tyr Val Glu Leu Gin Asp Pro Val Thr Arg Thr Gin Leu Arg Ala Met 
755 760 765 

get get aaa acg gtc tgc ccg ccg cat aaa gta gag ctt gaa gec ttg 
Ala Ala Lys Thr Val Cys Pro Pro His Lys Val Glu Leu Glu Ala Leu 
770 775 780 

ctt gaa aag caa gec tac aaa gaa caa gtg ctg gca aaa cgt tta aca 
Leu Glu Lys Gin Ala Tyr Lys Glu Gin Val Leu Ala Lys Arg Leu Thr 
785 790 795 

atg ctt gaa ctg ctt gaa aaa tac ccg gcg tgt gaa atg aaa ttc age 
Met Leu Glu Leu Leu Glu Lys Tyr Pro Ala Cys Glu Met Lys Phe Ser 
800 805 810 815 

gaa ttt ate gee ctt ctg cca age ata cgc ccg cgc tat tac teg att 
Glu Phe lie Ala Leu Leu Pro Ser lie Arg Pro Arg Tyr Tyr Ser lie 
820 825 830 

tct tea tea cct cgt gtc gat gaa aaa caa gca age ate acg gtc age 
Ser Ser Ser Pro Arg Val Asp Glu Lys Gin Ala Ser lie Thr Val Ser 
835 840 845 

gtt gtc tea gga gaa gcg tgg age gga tat gga gaa tat aaa gga att 
Val Val Ser Gly Glu Ala Trp Ser Gly Tyr Gly Glu Tyr Lys Gly lie 
850 855 860 

gcg teg aac tat ctt gee gag ctg caa gaa gga gat acg att acg tgc 
Ala Ser Asn Tyr Leu Ala Glu Leu Gin Glu Gly Asp Thr lie Thr Cys 
865 870 875 

ttt att tec aca ccg cag tea gaa ttt acg ctg cca aaa gac cct gaa 
Phe lie Ser Thr Pro Gin Ser Glu Phe Thr Leu Pro Lys Asp Pro Glu 
880 885 890 895 

acg ccg ctt ate atg gtc gga ccg gga aca ggc gtc gcg ccg ttt aga 
Thr Pro Leu lie Met Val Gly Pro Gly Thr Gly Val Ala Pro Phe Arg 
900 905 910 
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ggc ttt gtg cag gcg cgc aaa cag eta aaa gaa caa gga cag tea ctt 2784 
Gly Phe Val Gin Ala Arg Lys Gin Leu Lys Glu Gin Gly Gin Ser Leu 
915 920 • 925 

gga gaa gca cat tta tac ttc ggc tgc cgt tea cct cat gaa gac tat 2832 
Gly Glu Ala His Leu Tyr Phe Gly Cys Arg Ser Pro His Glu Asp Tyr 
930 935 940 

ctg tat caa gaa gag ctt gaa aac gec caa age gaa ggc ate att acg 2880 
Leu Tyr Gin Glu Glu Leu Glu Asn Ala Gin Ser Glu Gly lie lie Thr 
945 950 955 

ctt cat acc get ttt tct cgc atg cca aat cag ccg aaa aca tac gtt 2928 
Leu His Thr Ala Phe Ser Arg Met Pro Asn Gin Pro Lys Thr Tyr Val 
960 965 970 975 

cag cac gta atg gaa caa gac ggc aag aaa ttg att gaa ctt ctt gat 29 76 
Gin His Val Met Glu Gin Asp Gly Lys Lys Leu lie Glu Leu Leu Asp 
980 985 990 

caa gga gcg cac ttc tat att tgc gga gac gga age caa atg gca cct 3024 
Gin Gly Ala His Phe Tyr lie Cys Gly Asp Gly Ser Gin Met Ala Pro 
995 1000 1005 

gec gtt gaa gca acg ctt atg aaa age tat get gac gtt cac caa gtg 3072 
Ala Val Glu Ala Thr Leu Met Lys Ser Tyr Ala Asp Val His Gin Val 
1010 1015 1020 

agt gaa gca gac get cgc tta tgg ctg cag cag eta gaa gaa aaa ggc 3120 
Ser Glu Ala Asp Ala Arg Leu Trp Leu Gin Gin Leu Glu Glu Lys Gly 
1025 1030 1035 

cga tac gca aaa gac gtg tgg get ggg taa 3150 
Arg Tyr Ala Lys Asp Val Trp Ala Gly 
1040 1045 

<210> 2 
<211> 1048 
<212> PRT 

<213> Bacillus megaterium 
<400> 2 

Thr lie Lys Glu Met Pro Gin Pro Lys Thr Phe Gly Glu Leu Lys Asn 
15 10 15 

Leu Pro Leu Leu Asn Thr Asp Lys Pro Val Gin Ala Leu Met Lys lie 
20 25 30 



Ala Asp Glu Leu Gly Glu lie Phe Lys Phe Glu Ala Pro Gly Arg Val 
35 40 45 
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Thr Arg Tyr Leu Ser Ser Gin Arg Leu lie Lys Glu Ala Cys Asp Glu 
50 55 60 

Ser Arg Phe Asp Lys Asn Leu Ser Gin Ala Leu Lys Phe Val Arg Asp 



Phe Ala Gly Asp Gly Leu Phe Thr Ser Trp Thr His Glu Lys Asn Trp 



Lys Lys Ala His Asn lie Leu Leu Pro Ser Phe Ser Gin Gin Ala Met 
100 105 110 

Lys Gly Tyr His Ala Met Met Val Asp lie Ala Val Gin Leu Val Gin 
115 120 125 

Lys Trp Glu Arg Leu Asn Ala Asp Glu His lie Glu Val Pro Glu Asp 
130 135 140 

Met Thr Arg Leu Thr Leu Asp Thr lie Gly Leu Cys Gly Phe Asn Tyr 
145 150 155 160 

Arg Phe Asn Ser Phe Tyr Arg Asp Gin Pro His Pro Phe lie Thr Ser 
165 170 175 

Met Val Arg Ala Leu Asp Glu Ala Met Asn Lys Leu Gin Arg Ala Asn 
180 185 190 

Pro Asp Asp Pro Ala Tyr Asp Glu Asn Lys Arg Gin Phe Gin Glu Asp 
195 200 205 

lie Lys Val Met Asn Asp Leu Val Asp Lys lie lie Ala Asp Arg Lys 
210 215 220 

Ala Ser Gly Glu Gin Ser Asp Asp Leu Leu Thr His Met Leu Asn Gly 
225 230 235 240 

Lys Asp Pro Glu Thr Gly Glu Pro Leu Asp Asp Glu Asn lie Arg Tyr 
245 250 255 

Gin He He Thr Phe Leu He Ala Gly His Glu Thr Thr Ser Gly Leu 
260 265 270 

Leu Ser Phe Ala Leu Tyr Phe Leu Val Lys Asn Pro His Val Leu Gin 
275 280 285 

Lys Ala Ala Glu Glu Ala Ala Arg Val Leu Val Asp Pro Val Pro Ser 
290 295 300 



Tyr Lys Gin Val Lys Gin Leu Lys Tyr Val Gly Met Val Leu Asn Glu 
305 310 315 320 
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Ala Leu Arg Leu 



Glu Asp Thr Val 
340 

Leu Met Val Leu 

355 

Asp Asp Val Glu 
370 

lie Pro Gin His 
385 

He Gly Gin Gin 



Met Leu Lys His 
420 

He Lys Glu Thr 
435 

Lys Ser Lys Lys 
450 

Gin Ser Ala Lys 
465 

Pro Leu Leu Val 



Ala Arg Asp Leu 
500 

Val Ala Thr Leu 

515 

Val Leu He Val 
530 

Lys Gin Phe Val 
545 

Gly Val Arg Tyr 



Thr Tyr Gin Lys 
580 



Trp Pro Thr Ala 

325 

Leu Gly Gly Glu 



lie Pro Gin Leu 
360 

Glu Phe Arg Pro 
375 

Ala Phe Lys Pro 
390 

Phe Ala Leu His 
405 

Phe Asp Phe Glu 



Leu Thr Leu Lys 
440 

He Pro Leu Gly 
455 

Lys Val Arg Lys 
470 

Leu Tyr Gly Ser 
485 

Ala Asp He Ala 



Asp Ser His Ala 
520 

Thr Ala Ser Tyr 
535 

Asp Trp Leu Asp 
550 

Ser Val Phe Gly 
565 

Val Pro Ala Phe 
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Pro Ala Phe Ser 
330 

Tyr Pro Leu Glu 
345 

His Arg Asp Lys 



Glu Arg Phe Glu 
380 

Phe Gly Asn Gly 
395 

Glu Ala Thr Leu 
410 

Asp His Thr Asn 

425 

Pro Glu Gly Phe 



Gly He Pro Ser 
460 

Lys Ala Glu Asn 
475 

Asn Met Gly Thr 
490 

Met Ser Lys Gly 
505 

Gly Asn Leu Pro 



Asn Gly His Pro 
540 

Gin Ala Ser Ala 
555 

Cys Gly Asp Lys 
570 

He Asp Glu Thr 

585 



Leu Tyr Ala Lys 
335 

Lys Gly Asp Glu 
350 

Thr He Trp Gly 
365 

Asn Pro Ser Ala 



Gin Arg Ala Cys 
400 

Val Leu Gly Met 
415 

Tyr Glu Leu Asp 
430 

Val Val Lys Ala 
445 

Pro Ser Thr Glu 



Ala His Asn Thr 
480 

Ala Glu Gly Thr 
495 

Phe Ala Pro Gin 
510 

Arg Glu Gly Ala 

525 

Pro Asp Asn Ala 



Asp Glu Val Lys 
560 

Asn Trp Ala Thr 
575 

Leu Ala Ala Lys 

590 
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Gly Ala Glu Asn lie Ala Asp Arg Gly Glu Ala Asp Ala Ser Asp Asp 



Phe Glu Gly Thr Tyr Glu Glu Trp Arg Glu His Met Trp Ser Asp Val 
610 615 620 

Ala Ala Tyr Phe Asn Leu Asp lie Glu Asn Ser Glu Asp Asn Lys Ser 
625 630 635 640 

Thr Leu Ser Leu Gin Phe Val Asp Ser Ala Ala Asp Met Pro Leu Ala 
645 650 655 

Lys Met His Gly Ala Phe Ser Thr Asn Val Val Ala Ser Lys Glu Leu 
660 665 670 

Gin Gin Pro Gly Ser Ala Arg Ser Thr Arg His Leu Glu lie Glu Leu 
675 680 685 

Pro Lys Glu Ala Ser Tyr Gin Glu Gly Asp His Leu Gly Val lie Pro 
690 695 700 

Arg Asn Tyr Glu Gly lie Val Asn Arg Val Thr Ala Arg Phe Gly Leu 
705 710 715 720 

Asp Ala Ser Gin Gin lie Arg Leu Glu Ala Glu Glu Glu Lys Leu Ala 
725 730 735 

His Leu Pro Leu Ala Lys Thr Val Ser Val Glu Glu Leu Leu Gin Tyr 
740 745 750 

Val Glu Leu Gin Asp Pro Val Thr Arg Thr Gin Leu Arg Ala Met Ala 
755 760 765 

Ala Lys Thr Val Cys Pro Pro His Lys Val Glu Leu Glu Ala Leu Leu 
770 775 780 

Glu Lys Gin Ala Tyr Lys Glu Gin Val Leu Ala Lys Arg Leu Thr Met 
785 790 795 800 

Leu Glu Leu Leu Glu Lys Tyr Pro Ala Cys Glu Met Lys Phe Ser Glu 
805 810 815 

Phe lie Ala Leu Leu Pro Ser lie Arg Pro Arg Tyr Tyr Ser lie Ser 
820 825 830 

Ser Ser Pro Arg Val Asp Glu Lys Gin Ala Ser lie Thr Val Ser Val 
835 840 845 



Val Ser Gly Glu Ala Trp Ser Gly Tyr Gly Glu Tyr Lys Gly lie Ala 
850 855 860 
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Ser Asn Tyr Leu Ala Glu Leu Gin Glu Gly Asp Thr lie Thr Cys Phe 
865 870 875 880 

lie Ser Thr Pro Gin Ser Glu Phe Thr Leu Pro Lys Asp Pro Glu Thr 
885 890 895 

Pro Leu lie Met Val Gly Pro Gly Thr Gly Val Ala Pro Phe Arg Gly 
900 905 910 

Phe Val Gin Ala Arg Lys Gin Leu Lys Glu Gin Gly Gin Ser Leu Gly 
915 920 925 

Glu Ala His Leu Tyr Phe Gly Cys Arg Ser Pro His Glu Asp Tyr Leu 
930 935 940 

Tyr Gin Glu Glu Leu Glu Asn Ala Gin Ser Glu Gly lie lie Thr Leu 
945 950 955 960 

His Thr Ala Phe Ser Arg Met Pro Asn Gin Pro Lys Thr Tyr Val Gin 
965 970 975 

His Val Met Glu Gin Asp Gly Lys Lys Leu lie Glu Leu Leu Asp Gin 
980 985 990 

Gly Ala His Phe Tyr lie Cys Gly Asp Gly Ser Gin Met Ala Pro Ala 
995 1000 1005 

Val Glu Ala Thr Leu Met Lys Ser Tyr Ala Asp Val His Gin Val Ser 
1010 1015 1020 

Glu Ala Asp Ala Arg Leu Trp Leu Gin Gin Leu Glu Glu Lys Gly Arg 
1025 1030 1035 1040 

Tyr Ala Lys Asp Val Trp Ala Gly 
1045 

<210> 3 
<211> 30 
<212> DNA 

<213> Synthetic sequence 
<220> 

<223> Description of the synthetic sequence: PCR primer 
<400> 3 

gcaggagacg ggttgnnnac aagctggacg 3 0 

<210> 4 
<211> 30 
<212> DNA 

<213> Synthetic sequence 
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<220> 

<223> Description of the synthetic sequence: PCR primer 
<400> 4 

cgtccagctt gtnnncaacc cgtctcctgc 

<210> 5 
<211> 34 
<212> DNA 

<213> Synthetic sequence 
<220> 

<223> Description of the synthetic sequence: PCR primer 
<400> 5 

gaagcaatga acaagnnnca gcgagcaaat ccag 

<210> 6 

<211> 30 

<212> DNA 

<213> Synthetic sequence 
<220> 

<223> Description of the synthetic sequence: PCR primer 

<400> 6 

ctggatttgc tcgctgnnnc ttgttcattg 

<210> 7 
<211> 41 
<212> DNA 

<213> Synthetic sequence 
<220> 

<2 23> Description of the synthetic sequence: PCR primer 
<400> 7 

gctttgataa aaacttaaag tcaannnctt aaatttgtac g 

<210> 8 
<211> 40 
<212> DNA 

<213> Synthetic sequence 
<220> 

<223> Description of the synthetic sequence: PCR primer 
<400> 8 

cgtacaaatt taagnnnttg acttaagttt ttatcaaagc 
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<210> 9 
<211> 1049 
<212> PRT 

<213> Bacillus megaterium 
<400> 9 

Met Thr lie Lys Glu Met Pro Gin Pro Lys Thr Phe Gly Glu Leu Lys 



Asn Leu Pro Leu Leu Asn Thr Asp Lys Pro Val Gin Ala Leu Met Lys 
20 25 30 

lie Ala Asp Glu Leu Gly Glu lie Phe Lys Phe Glu Ala Pro Gly Arg 



Val Thr Arg Tyr Leu Ser Ser Gin Arg Leu lie Lys Glu Ala Cys Asp 
50 55 60 

Glu Ser Arg Phe Asp Lys Asn Leu Ser Gin Ala Leu Lys Phe Val Arg 
65 70 75 80 

Asp Phe Ala Gly Asp Gly Leu Phe Thr Ser Trp Thr His Glu Lys Asn 
85 90 95 

Trp Lys Lys Ala His Asn lie Leu Leu Pro Ser Phe Ser Gin Gin Ala 
100 105 110 

Met Lys Gly Tyr His Ala Met Met Val Asp He Ala Val Gin Leu Val 
115 120 125 

Gin Lys Trp Glu Arg Leu Asn Ala Asp Glu His He Glu Val Pro Glu 
130 135 140 

Asp Met Thr Arg Leu Thr Leu Asp Thr He Gly Leu Cys Gly Phe Asn 
145 150 155 160 

Tyr Arg Phe Asn Ser Phe Tyr Arg Asp Gin Pro His Pro Phe He Thr 
165 170 175 

Ser Met Val Arg Ala Leu Asp Glu Ala Met Asn Lys Leu Gin Arg Ala 
180 185 190 

Asn Pro Asp Asp Pro Ala Tyr Asp Glu Asn Lys Arg Gin Phe Gin Glu 
195 200 205 

Asp He Lys Val Met Asn Asp Leu Val Asp Lys He He Ala Asp Arg 
210 215 220 



Lys Ala Ser Gly Glu Gin Ser Asp Asp Leu Leu Thr His Met Leu Asn 
225 230 235 240 
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Gly Lys Asp Pro Glu Thr Gly Glu Pro Leu Asp Asp Glu Asn lie Arg 



Tyr Gin He He Thr Phe Leu He Ala Gly His Glu Thr Thr Ser Gly 
260 265 270 

Leu Leu Ser Phe Ala Leu Tyr Phe Leu Val Lys Asn Pro His Val Leu 
275 280 285 

Gin Lys Ala Ala Glu Glu Ala Ala Arg Val Leu Val Asp Pro Val Pro 
290 295 300 

Ser Tyr Lys Gin Val Lys Gin Leu Lys Tyr Val Gly Met Val Leu Asn 
305 310 315 320 

Glu Ala Leu Arg Leu Trp Pro Thr Ala Pro Ala Phe Ser Leu Tyr Ala 
325 330 335 

Lys Glu Asp Thr Val Leu Gly Gly Glu Tyr Pro Leu Glu Lys Gly Asp 
340 345 350 

Glu Leu Met Val Leu lie Pro Gin Leu His Arg Asp Lys Thr lie Trp 
355 360 365 

Gly Asp Asp Val Glu Glu Phe Arg Pro Glu Arg Phe Glu Asn Pro Ser 
370 375 380 

Ala He Pro Gin His Ala Phe Lys Pro Phe Gly Asn Gly Gin Arg Ala 
385 390 395 400 

Cys He Gly Gin Gin Phe Ala Leu His Glu Ala Thr Leu Val Leu Gly 
405 410 415 

Met Met Leu Lys His Phe Asp Phe Glu Asp His Thr Asn Tyr Glu Leu 
420 425 430 

Asp He Lys Glu Thr Leu Thr Leu Lys Pro Glu Gly Phe Val Val Lys 
435 440 445 

Ala Lys Ser Lys Lys He Pro Leu Gly Gly He Pro Ser Pro Ser Thr 
450 455 460 

Glu Gin Ser Ala Lys Lys Val Arg Lys Lys Ala Glu Asn Ala His Asn 
465 470 475 480 

Thr Pro Leu Leu Val Leu Tyr Gly Ser Asn Met Gly Thr Ala Glu Gly 
485 490 495 



Thr Ala Arg Asp Leu Ala Asp He Ala Met Ser Lys Gly Phe Ala Pro 
500 505 510 
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Gin Val Ala Thr Leu Asp Ser His Ala Gly Asn Leu Pro Arg Glu Gly 
515 520 525 

Ala Val Leu lie Val Thr Ala Ser Tyr Asn Gly His Pro Pro Asp Asn 
530 535 540 

Ala Lys Gin Phe Val Asp Trp Leu Asp Gin Ala Ser Ala Asp Glu Val 
545 550 555 560 

Lys Gly Val Arg Tyr Ser Val Phe Gly Cys Gly Asp Lys Asn Trp Ala 
565 570 575 

Thr Thr Tyr Gin Lys Val Pro Ala Phe lie Asp Glu Thr Leu Ala Ala 
580 585 590 

Lys Gly Ala Glu Asn lie Ala Asp Arg Gly Glu Ala Asp Ala Ser Asp 
595 600 605 

Asp Phe Glu Gly Thr Tyr Glu Glu Trp Arg Glu His Met Trp Ser Asp 
610 615 620 

Val Ala Ala Tyr Phe Asn Leu Asp lie Glu Asn Ser Glu Asp Asn Lys 
625 630 635 640 

Ser Thr Leu Ser Leu Gin Phe Val Asp Ser Ala Ala Asp Met Pro Leu 
645 650 655 

Ala Lys Met His Gly Ala Phe Ser Thr Asn Val Val Ala Ser Lys Glu 
660 665 670 

Leu Gin Gin Pro Gly Ser Ala Arg Ser Thr Arg His Leu Glu lie Glu 
675 680 685 

Leu Pro Lys Glu Ala Ser Tyr Gin Glu Gly Asp His Leu Gly Val lie 
690 695 700 

Pro Arg Asn Tyr Glu Gly lie Val Asn Arg Val Thr Ala Arg Phe Gly 
705 710 715 720 

Leu Asp Ala Ser Gin Gin lie Arg Leu Glu Ala Glu Glu Glu Lys Leu 
725 730 735 

Ala His Leu Pro Leu Ala Lys Thr Val Ser Val Glu Glu Leu Leu Gin 
740 745 750 

Tyr Val Glu Leu Gin Asp Pro Val Thr Arg Thr Gin Leu Arg Ala Met 
755 760 765 



Ala Ala Lys Thr Val Cys Pro Pro His Lys Val Glu Leu Glu Ala Leu 
770 775 780 
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Leu Glu Lys Gin Ala Tyr Lys Glu Gin Val Leu Ala Lys Arg Leu Thr 
785 790 795 800 

Met Leu Glu Leu Leu Glu Lys Tyr Pro Ala Cys Glu Met Lys Phe Ser 
805 810 815 

Glu Phe lie Ala Leu Leu Pro Ser lie Arg Pro Arg Tyr Tyr Ser lie 
820 825 830 

Ser Ser Ser Pro Arg Val Asp Glu Lys Gin Ala Ser lie Thr Val Ser 
835 840 845 

Val Val Ser Gly Glu Ala Trp Ser Gly Tyr Gly Glu Tyr Lys Gly lie 
850 855 860 

Ala Ser Asn Tyr Leu Ala Glu Leu Gin Glu Gly Asp Thr lie Thr Cys 
865 870 875 880 

Phe lie Ser Thr Pro Gin Ser Glu Phe Thr Leu Pro Lys Asp Pro Glu 
885 890 895 

Thr Pro Leu lie Met Val Gly Pro Gly Thr Gly Val Ala Pro Phe Arg 
900 905 910 

Gly Phe Val Gin Ala Arg Lys Gin Leu Lys Glu Gin Gly Gin Ser Leu 
915 920 925 

Gly Glu Ala His Leu Tyr Phe Gly Cys Arg Ser Pro His Glu Asp Tyr 
930 935 940 

Leu Tyr Gin Glu Glu Leu Glu Asn Ala Gin Ser Glu Gly lie lie Thr 
945 950 955 960 

Leu His Thr Ala Phe Ser Arg Met Pro Asn Gin Pro Lys Thr Tyr Val 
965 970 975 

Gin His Val Met Glu Gin Asp Gly Lys Lys Leu lie Glu Leu Leu Asp 
980 985 990 

Gin Gly Ala His Phe Tyr lie Cys Gly Asp Gly Ser Gin Met Ala Pro 
995 1000 1005 

Ala Val Glu Ala Thr Leu Met Lys Ser Tyr Ala Asp Val His Gin Val 
1010 1015 1020 

Ser Glu Ala Asp Ala Arg Leu Trp Leu Gin Gin Leu Glu Glu Lys Gly 
1025 1030 1035 1040 



Arg Tyr Ala Lys Asp Val Trp Ala Gly 
1045 



